EfficientNet

Overview

EfficientNet is a family of convolutional neural network architectures introduced to address the problem of scaling CNNs in a principled and systematic way. Rather than arbitrarily increasing network depth or width, EfficientNet proposes a compound scaling method that jointly scales depth, width, and input resolution using a fixed set of scaling coefficients.

This approach demonstrated that carefully balanced scaling can yield significantly better accuracy–efficiency tradeoffs than conventional scaling strategies, and it influenced subsequent CNN design, particularly for resource-constrained and production environments.

Architectural Characteristics

Compound scaling of depth, width, and input resolution
Use of Mobile Inverted Bottleneck (MBConv) blocks
Depthwise separable convolutions
Squeeze-and-Excitation (SE) modules
Progressive stage-wise scaling across the network
Family defined by a base model and scaled variants

Design Rationale

Prior CNN architectures typically scaled along a single dimension (e.g., making networks deeper), often leading to diminishing returns or inefficient use of parameters. EfficientNet was designed to demonstrate that model capacity should be increased in a balanced way across multiple dimensions.

The compound scaling rule enforces this balance, ensuring that increases in depth, width, and resolution reinforce each other rather than becoming bottlenecks. The use of MBConv blocks and SE modules further improves parameter efficiency and representational capacity.

Training Paradigm

Supervised training with cross-entropy loss
Extensive use of data augmentation and regularization
Standard gradient-based optimization
Typically trained on large-scale image datasets
Scaled variants share architectural structure but differ in capacity and resolution

Notable Variants

EfficientNet-B0 (base model)
EfficientNet-B1 through EfficientNet-B7

Benchmark Performance (Reference)

Historical reference results on ImageNet under standard training protocols:

Model	Dataset	Metric	Result
EfficientNet-B0	ImageNet	Top-1 Accuracy	~77%
EfficientNet-B4	ImageNet	Top-1 Accuracy	~83%
EfficientNet-B7	ImageNet	Top-1 Accuracy	~84%