Neubla Optimized Model Repository
Model categories
Vision
NLP
Multi-Modal
Compression techniques
Pruning: channel-pruning, 2:4 semi-structured, etc.
Quantization: W8A8, W4AF16, WF8AF8, etc.
Parameter-efficient finetuning: PEFT, etc.