[Example] NdLinear + LoRA Fine-Tuning on SmallViT (MNIST)Add files via upload #588

aryanator · 2025-04-30T18:05:54Z

This notebook demonstrates an efficient fine-tuning strategy for vision transformers by combining:

NdLinear: A compressed linear layer that introduces tensor factorization to reduce parameter redundancy.
LoRA (Low-Rank Adaptation): Lightweight fine-tuning via trainable low-rank matrices.

A wrapper (NdLinearAdapter) to replace nn.Linear with NdLinear across ViT blocks.
LoRA injection into NdLinear using pre-forward hooks.
Training loop for three model variants:
- Standard SmallViT
- SmallViT with NdLinear
- SmallViT with NdLinear + LoRA
Visualization of loss, accuracy, and singular value distribution
Final comparison of parameter counts, file size, and accuracy

Model	Parameters	Accuracy	Model Size
Standard ViT	5.52M	95.01%	22.11 MB
NdLinear ViT	5.52M	95.81%	22.11 MB
NdLinear + LoRA	5.67M	94.86%	22.70 MB

This notebook is intended as a research-style educational example for the examples/ directory.

review-notebook-app · 2025-04-30T18:05:59Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

Add files via upload

e996467

Provide feedback