About
Open-source fine-tuning & reinforcement learning for LLMs. π¦₯
Features
- 2x faster fine-tuning via optimized kernels
- Memory reduction up to 50% for 7B+ models
- Support for Llama, Mistral, Qwen, and more
- RLHF (reinforcement learning from human feedback) integration
- Automatic checkpointing and resume
Links
Categories
Reviews
5
Write a Review
Get new AI tools weekly
Subscribe to our newsletter and never miss a tool.
Related Tools
Get new AI tools weekly
Subscribe to our newsletter and never miss a tool.