Jump to content

Efficient Software Implementation of Deep Learning Architectures on Embedded Systems

Fast facts

  • Internal authorship

  • Publishment

    • 2025
  • Type of research service

    Scientific lecture

  • Organizational unit

  • Subjects

    • Electrical engineering in general
  • Publication format

    Other research performance

Quote

T. Fei, Efficient Software Implementation of Deep Learning Architectures on Embedded Systems. 2025 [Online]. Available: https://stup.ferit.hr/2025/03/10/gp-efficient-software-implementation-of-deep-learning-architectures-on-embedded-systems/?utm_source=rss&utm_medium=rss&utm_campaign=gp-efficient-software-implementation-of-deep-learning-architectures-on-embedded-systems

Content

This scientific talk discusses methods for optimizing deep learning architectures on embedded systems. It highlights key challenges, such as limited processing power, memory constraints, and real-time performance requirements. Model compression techniques, including quantization, pruning, knowledge distillation, and weight sharing, are explored to reduce memory usage and computational complexity. Hardware-software co-design is emphasized, leveraging specialised accelerators like NPUs, GPUs, and FPGAs to improve efficiency. Additionally, software optimization techniques, demonstrated through a radar-based hand gesture recognition project, showcase how deep learning can be effectively deployed on edge devices while balancing accuracy, performance, and resource constraints.

Notes and references

This site uses cookies to ensure the functionality of the website and to collect statistical data. You can object to the statistical collection via the data protection settings (opt-out).

Settings(Opens in a new tab)