kellogh, #opensource WizardLM2 8x22B exceeds performance of GPT4 in some benchmarks
- Apache2 👍
- progressive learning instead of all-at-once means less power-hungry and more data efficient during training
- Co-Teaching and Self-Teaching are intriguing, I want to hear more
- from Microsoft #AI, I imagine GPT5 must be nigh, if they’re releasing competition for GPT4