Energy-aware hardware optimizations make 2D hierarchical clustering fast and practical for low-power embedded and edge devices.

Why Energy Constraints Are Redefining On-Device Machine Learning
In recent years, the need to bring machine learning algorithms directly onto hardware has prompted a deep reflection on how to balance accuracy, speed, and energy consumption. Within this context, work has been carried out to make two-dimensional clustering feasible even in scenarios that are strongly constrained in terms of resources, such as sensor networks and low-power embedded systems.
A 2D Hierarchical Fuzzy Clustering Method as the Starting Point
The starting point is a two-dimensional hierarchical clustering algorithm based on a spatial representation of data and fuzzy membership functions. This approach makes it possible to identify complex data structures, even in the presence of overlapping clusters or nontrivial shapes, but it typically comes with a high computational cost that is poorly suited to energy-limited devices.
Hardware-Driven Simplifications for Parallel and Low-Precision Execution
To overcome this limitation, the focus shifted to a hardware-oriented implementation of the algorithm. Targeted mathematical simplifications are introduced, designed not to alter the behavior of the method but to make it more suitable for parallel execution and low-precision arithmetic. In particular, some computationally expensive operations are replaced with lighter approximations, such as the use of less costly distance measures and the reformulation of scaling factors to avoid complex multiplications and divisions.
A Parallel Fixed-Point Architecture for Efficient Grid Evaluation
The resulting architecture is designed to exploit the intrinsic parallelism of digital hardware, enabling the simultaneous evaluation of membership functions over multiple points of the two-dimensional grid. The adoption of fixed-point arithmetic with a limited number of bits helps to reduce energy consumption while preserving sufficient precision to maintain clustering quality.
Validating Accuracy While Gaining Speed and Energy Efficiency
System validation is performed by comparing the results of this hardware implementation with those of the original theoretical model. Tests on several two-dimensional datasets show that the introduced approximations do not compromise cluster identification, while the gains in speed and energy efficiency are significant.
Toward Distributed Unsupervised Intelligence at the Edge
The outcome is a concrete demonstration of how unsupervised learning algorithms can be adapted to real-world contexts where energy and computational resources are limited. It represents an important step toward integrating advanced data analysis techniques directly into devices, enabling distributed, efficient, and truly operational intelligence in the field.
Contatti
-
E-Mail:
-
Telefono:
Link utili
Autori
G. C. Cardarilli, R. Fazzolari, M. Matta, M. Panella, A. Rosato, S. Spanò
Agosto 6, 2020









