Parallel array histogram architecture for embedded implementations

Parallel array histogram architecture for embedded implementations

For access to this article, please select a purchase option:

Buy article PDF
(plus tax if applicable)
Buy Knowledge Pack
10 articles for £75.00
(plus taxes if applicable)

IET members benefit from discounts to all IET publications and free access to E&T Magazine. If you are an IET member, log in to your account and the discounts will automatically be applied.

Learn more about IET membership 

Recommend Title Publication to library

You must fill out fields marked with: *

Librarian details
Your details
Why are you recommending this title?
Select reason:
Electronics Letters — Recommend this title to your library

Thank you

Your recommendation has been sent to your librarian.

Proposed is a parallel array histogram architecture (PAHA) suitable for embedded implementations. The PAHA uses a register array instead of a memory block to store the histogram bins. In each step, M inputs can be processed in parallel to update the histogram bins without any additional latency. Also described is a second version of the PAHA with a flexible number of inputs, potentially avoiding the need for multiple PAHAs in a single application. Implementation results show that the architecture can achieve a super-linear speed-up of 43.75× for a 16-way PAHA when compared to a software implementation in a general-purpose processor.


    1. 1)
      • Farah, R., Gan, Q., Langlois, J.M.P., Bilodeau, G.A., Savaria, Y.: `A tracking algorithm suitable for embedded systems implementation', Proc. of IEEE ICECS 2011, 2011, Beirut, Lebanon, p. 256–259
    2. 2)
    3. 3)
      • Shahbahrami, A., Juurlink, B., Vassiliadis, S.: `SIMD vectorization of histogram functions', Proc. of ASAP, 2007, Montreal, Quebec, Canada, p. 174–179
    4. 4)
      • Muller, S.: `A new programmable VLSI architecture for histogram and statistics computation in different windows', Proc. ICIP, 1995, Washington, DC, USA, p. 73–76
    5. 5)
      • Shahbahrami, A., Hur, J.Y., Juulink, B., Wong, S.: `FPGA implementation of parallel histogram computation', 2ndHiPEAC Workshop on Reconfigurable Computing, 2008, Göteborg, Sweden, p. 63–72
    6. 6)
      • Jamro, E., Wielgosz, M., Wiatr, K.: `FPGA implementation of the strongly parallel histogram equalization', Proc. IEEE Workshop on DDECS, 2007, Krakow, Poland, p. 1–6
    7. 7)
    8. 8)
    9. 9)
      • Verma, A.K., Ienne, P.: `Automatic synthesis of compressor trees: re-evaluating large counters', Proc. of DATE, 2007, San Jose, CA, USA, p. 443–448
    10. 10)
      • Tensilica Inc, Xtensa LX microprocessor data book for Xtensa LX2 processor cores, 2007

Related content

This is a required field
Please enter a valid email address