Revisiting Semi-Supervised Learning in the Era of Foundation Models

Official implementation of the following work:

P. Zhang, Z. Mai, Q.-H. Nguyen & W.-L. Chao. Revisiting semi-supervised learning in the era of foundation models. arXiv.

Introduction

This repository contains the official implementation of V-PET (VFM-PEFT Enesmble Training). The code is modified from USB. Original copyright notice:

Copyright (c) 2021 Othneil Drew

Preparation

Environment

We recommend using Conda to create a python=3.9 environment.

conda create -n vpet python=3.9
conda activate vpet

Then install the required packages:

pip install -r requirements.txt

Dataset & Pretrained Model

Please download our models and data. Then, unzip the files and place them in data and pretrain_weight directories with the following structure:

├── data
│   └── vtab
│       ├── clevr_count
│       │   ├── images
│       │   │   ├── 000
│       │   │   ├── 001
│       │   │   └── ...
│       │   ├── labeded_idx
│       │   ├── test.list
│       │   ├── train.list
│       │   ├── trainval.list
│       │   └── val.list
│       ├── diabetic_retinopathy
│       │   └── ...
│       ├── dtd
│       │   └── ...
│       ├── kitti
│       │   └── ...
│       ├── resisc45
│       │   └── ...
│       └── sun397
│           └── ...
├── pretrain_weight
│   ├── vit_base_patch14_reg4_dinov2_lvd142m.bin
│   └── vit_base_patch16_clip_224_openai.bin
└──[other files]

Workflow

Due to V-PET is an ensemble method which requires hyperparameter tuning before training, our workflow is implemented in a three-step process:

Train: Train the model on labeled data.
Tune: Based on the trained model, tune the hyperparameters on the validation set.
V-PET: Run V-PET on the pseudo-labels generated by the tuned model.

Train

All the training commands are in scripts/ folder. To run all the scripts, we can simply run run_train.sh in the root directory. Or we can run our intended commands independently. Note that to run V-PET without other SSL baselines, we only need to run the commands in scripts/clip/run_supervised.sh and scripts/dinov2/run_supervised.sh to train the labeled-only models.

Tune

After training the models. We can generate hyperparameter tuning metrics by running the following command:

# This command will generate the list of models that we need to generate hyperparameter tuning informatics. The list will be saved in `eval_list.pkl`.
python eval_gen_list.py

# This command will read `eval_list.pkl` file and generate the metrics for each model in the list. Results will be saved in each log folder.
python eval.py

Then, we can collect the metrics with SQLite by running the following command:

python tune_collect.py

V-PET

After generating the hyperparameter tuning metrics, we can run the following command to generate config files for V-PET:

python gen_config_pet.py

Then we can run V-PET with the generated config files, for example:

python train.py --c "config/lora/pet-ensemble/dtd/3-shot/clip/config.yaml"

Similarly, we can collect the results using SQLite:

python tune_collect.py

Print Results

To read the tuned results from SQLite, we can run the following command:

python tune_print.py

Citation

@misc{zhang2025revisitingsemisupervisedlearningera,
      title={Revisiting semi-supervised learning in the era of foundation models}, 
      author={Ping Zhang and Zheda Mai and Quang-Huy Nguyen and Wei-Lun Chao},
      year={2025},
      eprint={2503.09707},
      archivePrefix={arXiv},
      primaryClass={cs.LG},
      url={https://arxiv.org/abs/2503.09707}, 
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Revisiting Semi-Supervised Learning in the Era of Foundation Models

Introduction

Preparation

Environment

Dataset & Pretrained Model

Workflow

Train

Tune

V-PET

Print Results

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
config		config
pytorch_adapt		pytorch_adapt
scripts		scripts
semilearn		semilearn
LICENSE.txt		LICENSE.txt
README.md		README.md
eval.py		eval.py
eval_gen_list.py		eval_gen_list.py
gen_config_pet.py		gen_config_pet.py
requirements.txt		requirements.txt
run_train.sh		run_train.sh
train.py		train.py
tune_collect.py		tune_collect.py
tune_print.py		tune_print.py

License

OSU-MLB/SSL-Foundation-Models

Folders and files

Latest commit

History

Repository files navigation

Revisiting Semi-Supervised Learning in the Era of Foundation Models

Introduction

Preparation

Environment

Dataset & Pretrained Model

Workflow

Train

Tune

V-PET

Print Results

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages