[R] QuIP#: SOTA 2 bit LLMs

We're pleased to introduce QuIP#, a new SOTA LLM quantization method that uses incoherence processing from QuIP (the paper) & lattices to achieve 2 bit LLMs with near-fp16 performance! Now you can run LLaMA 2 70B on a 24G GPU w/out offloading!

QuIP# crushes all publicly available 2 bit PTQ methods on language modeling & zero shot tasks while being conceptually clean and simple. We’ve released quantized LLaMA, Mistral, and OpenHermes models, and a full codebase at https://github.com/Cornell-RelaxML/quip-sharp

More information on how QuIP# works here https://cornell-relaxml.github.io/quip-sharp/

Image

Image alternative text

Federation

Status:

On | Off

Instances:

/m/machinelearning

Threads (70)

Microblog (485)

All Content

People

Magazines

Collections

Thread

lostmsu

@lostmsu@kbin.social

Added: 5 months ago
Online: -
Ratio: 0

Magazine

Machine Learning

@machinelearning@kbin.social

Machine learning (ML) is a field devoted to understanding and building methods that let machines "learn" – that is, methods that leverage data to improve computer performance on some set of tasks.

Machine learning algorithms build a model based on sample data, known as training data, in order to make predictions or decisions without being explicitly programmed to do so. Machine learning algorithms are used in a wide variety of applications, such as in medicine, email filtering, speech recognition, agriculture, and computer vision, where it is difficult or unfeasible to develop conventional algorithms to perform the needed tasks.

Rules

Be nice: no offensive behavior, insults or attacks
Make your post clear and comprehensive
Limit self promotion

Created: 1 year ago
Owner: genesis
Subscribers: 1025
Online: -

Moderators

genesis
nsa

Active people

The Hundred-Page Machine Learning Book (PDF + EPUB + extra PDF formats) by Andriy Burkov is on sale on Leanpub! Its suggested price is $40.00; get it for $14.00 with this coupon: https://leanpub.com/sh/HEQaRVfD #DataScience #ComputerScience #MachineLearning #Ai

12 days ago to datascience

Scikit-learn 1.5 release highlights in video:...

15 days ago to python

Open your calendar, NumPy 2.0 is going to be out on June 16th 🚀...

16 days ago to python

"I'm NOT your emotional support Chicken!"...

13 days ago to aiart

Related threads

Inside the messy ethics of making war with machines

9 months ago to technology

How AI is helping airlines mitigate the climate impact of contrails

10 months ago to science

A 'black box' AI system has been influencing criminal justice decisions for over two decades—it's time to open it up

10 months ago to tech

Visions of Chaos Tutorials

1 year ago to visionsofchaos

Add comment