Passa al contenuto principale

Perceptron Theory Can Predict the Accuracy of Neural Networks

A perceptron-based theory predicts neural network accuracy using output statistics, fast, data-free, and surprisingly precise.

From Black Boxes to Predictable Models

In recent years, deep neural networks have become the invisible engine behind much of today’s digital innovation, from image processing to autonomous driving. Yet, despite their effectiveness, they largely remain “black boxes”: powerful, but opaque. What if we could predict a model’s accuracy before fully training or testing it?

A Classic Theory Reimagined for Modern AI

This is where a revolutionary idea comes into play: reviving perceptron theory, one of the simplest structures in artificial intelligence, to predict the performance of complex neural networks. This modern statistical formulation, designed to operate on the output layer of any architecture, from recurrent networks to deep CNNs, allows for highly accurate estimates of a model’s classification performance. The prediction relies only on a few statistical moments (such as mean and variance) of the postsynaptic sums, without needing to train any additional models.

High Accuracy Proven Across Models and Datasets

The results are impressive: this approach has been validated on over 120 real-world classification datasets and around 15 pretrained deep networks on ImageNet. In the latter case, the correlation between predicted and actual accuracy reaches 93%, with peaks of 97% using refined statistical approximations. Even in complex networks like ResNet, VGG, or NASNet, the method captures the model’s behavior with striking fidelity.

Scalable, Lightweight, and Data-Agnostic

Beyond precision, this model is both scalable and efficient: it can be applied to networks with thousands of output classes, reducing the entire evaluation process to a fast and non-invasive statistical analysis. In scenarios where data access is restricted (e.g., due to privacy), a variant of the method can estimate performance using only the final layer weights, still achieving notable predictive power.

A Step Toward More Transparent AI

The potential applications are wide-ranging: from selecting the best model in resource-constrained environments to building tools for explaining network decisions, and even detecting adversarial examples or outliers through the statistical behavior of activations.

Old Foundations for a New AI Era

In an era dominated by ever-larger and more complex models, rediscovering the simplicity and predictive power of a theory born over sixty years ago, and adapting it to modern AI, is not just an elegant intellectual move. It’s a decisive step toward more interpretable, trustworthy, and explainable neural networks.

Autori

D. Kleyko, A. Rosato, E. Paxon Frady, M. Panella, F. T. Sommer
Febbraio 6, 2023

Consigliati

Consigliati

Altri articoli da leggere
Renewable energy

A Review of the Enabling Methodologies for Knowledge Discovery from Smart Grids Data

A KDD-driven pipeline turns smart meter streams into multi-step load forecasts, benchmarking feature reduction and models.
Biomedical

Enhancing Autism Detection Through Gaze Analysis Using Eye Tracking Sensors and Data Attribution with Distillation in Deep Neural Networks

A deep learning model enhances early autism diagnosis by analyzing visual patterns with eye tracking.
Quantum computing

Quantum Generative Modeling via Straightforward State Preparation

A lightweight quantum generative model creates high-fidelity data samples with minimal parameters and efficient state preparation.
Quantum computing

Enhancing QAOA Ansatz via Multi-Parameterized Layer and Blockwise Optimization

A novel quantum-classical algorithm boosts QAOA performance with fewer layers, enabling real-world optimization on NISQ devices.
Renewable energy

A Deep Learning-based Approach for Battery Life Classification

A deep learning-based LSTM network accurately classifies battery health, optimizing energy storage and predictive maintenance.
Biomedical

An explainable fast deep neural network for emotion recognition

A fast, explainable deep neural network enhances emotion recognition by optimizing facial landmark analysis.
Renewable energy

Multi-label classification with imbalanced classes by fuzzy deep neural networks

A fuzzy deep neural network accurately classifies household appliances in real time using symbolic data and multi-label AI.
Quantum computing

Quantum enhanced knowledge distillation

Classical-to-quantum knowledge distillation boosts hybrid AI performance using efficient quantum circuits and reduced model sizes.
Quantum computing

A variational approach to quantum gated recurrent units

A faster and efficient Quantum Gated Recurrent Unit (QGRU) improves time series forecasting.
Aerospace

A Neural Network Symbolic Approach to Structural Health Monitoring in Aerospace Applications

A symbolic deep learning approach enhances structural health monitoring in aerospace achieving near-perfect damage classification.

Hai un'esigenza
specifica? 

Compila il form e parlaci del tuo progetto.
Ti proponiamo la soluzione più adatta al tuo contesto.
Impossibile salvare l'abbonamento. Riprova.
Grazie per aver inviato il modulo.

Hai un'esigenza
specifica? 

Compila il form e parlaci del tuo progetto.
Ti proponiamo la soluzione più adatta al tuo contesto.
GRID+ Copyright © 2026. All Rights Reserved.
P. IVA 17387741006 | Il capitale è stato interamente versato 10.000€ | RM – 1715269
GRID+ Copyright © 2026. All Rights Reserved.
P. IVA 17387741006 · Il capitale è stato interamente versato 10.000€ | RM – 1715269
P. IVA 17387741006 · Il capitale è stato interamente versato 10.000€ | RM – 1715269