• Journal of Semiconductors
  • Vol. 41, Issue 2, 022402 (2020)
Junyong Deng1, Lin Jiang2, Yun Zhu1, Xiaoyan Xie3, Xinchuang Liu1, Feilong He3, Shuang Song4, and L. K. John4
Author Affiliations
  • 1School of Electronic Engineering, Xi’an University of Posts & Telecommunications, Xi’an 710121, China
  • 2School of Communication and Information Engineering, Xi’an University of Science and Technology, Xi’an 710054, China
  • 3School of Computer, Xi’an University of Posts & Telecommunications, Xi’an 710121, China
  • 4The University of Texas at Austin, TX 78712, USA
  • show less
    DOI: 10.1088/1674-4926/41/2/022402 Cite this Article
    Junyong Deng, Lin Jiang, Yun Zhu, Xiaoyan Xie, Xinchuang Liu, Feilong He, Shuang Song, L. K. John. HRM: H-tree based reconfiguration mechanism in reconfigurable homogeneous PE array[J]. Journal of Semiconductors, 2020, 41(2): 022402 Copy Citation Text show less
    (Color online) The topology of HRM. (a) Unicast. (b) Multicast/broadcast.
    Fig. 1. (Color online) The topology of HRM. (a) Unicast. (b) Multicast/broadcast.
    (Color online) Homogeneous thin-core PE array. (a) Inter –cluster communicate. (b) PE cluster. (c) Micro-architecture of PE.
    Fig. 2. (Color online) Homogeneous thin-core PE array. (a) Inter –cluster communicate. (b) PE cluster. (c) Micro-architecture of PE.
    (Color online) Distributed shared memory structure.
    Fig. 3. (Color online) Distributed shared memory structure.
    (Color online) The prototype system of the proposed approach.
    Fig. 4. (Color online) The prototype system of the proposed approach.
    (Color online) Organizations of video processing system. (a) Original organization of video processing system. (b) Video processing system with higher parallelism.
    Fig. 5. (Color online) Organizations of video processing system. (a) Original organization of video processing system. (b) Video processing system with higher parallelism.
    Results of different sequences in video processing. (a) Salesman. (b) Bridge.
    Fig. 6. Results of different sequences in video processing. (a) Salesman. (b) Bridge.
    Mapping of viewport transportation.
    Fig. 7. Mapping of viewport transportation.
    (Color online) Rendering scene by proposed scheme.
    Fig. 8. (Color online) Rendering scene by proposed scheme.
    Data-driven mode mapping of six-tap filter. (a) Six-tap filter. (b) Data-driven mode mapping.
    Fig. 9. Data-driven mode mapping of six-tap filter. (a) Six-tap filter. (b) Data-driven mode mapping.
    ComponentParameterThis paperRSF
    PEBit-width of regs in a PE1616
    # of regs in a PE164
    # of PEs1616
    CM(4 kB)/IM(1 kB)Bit-width of a CM/IM3232
    # of CMs/IMs1616
    CB(1.5 kB)/DRAM(512 B)# of sets12
    # of banks in a set13
    Bit-width of a bank1632
    Table 1. PE cluster RTL implementation.
    FU changingPEOperation instructionCall instruction
    DCT→Intra16→228568
    DB→Intra15→820879
    Intra→DB9→1225848
    IME→DCT11→1631367
    FME→IME2→113916
    FME202
    MC505
    Table 2. The statistics of reconfiguration instructions.
    Comparison aspectPE clusterHRM
    This paperRSFHReAComparison
    Process (nm)90906590
    Area (gate equivalent)4155696658649136.9%↓4331
    Critical delay (ns)2.803.743.5725.1%↓, 21.6%↓2.02
    Power (mW)1.960.8145%↑0.48
    Table 3. Synthesis results of PE cluster.
    Junyong Deng, Lin Jiang, Yun Zhu, Xiaoyan Xie, Xinchuang Liu, Feilong He, Shuang Song, L. K. John. HRM: H-tree based reconfiguration mechanism in reconfigurable homogeneous PE array[J]. Journal of Semiconductors, 2020, 41(2): 022402
    Download Citation