this post was submitted on 31 May 2024
25 points (96.3% liked)

AI

4155 readers
2 users here now

Artificial intelligence (AI) is intelligence demonstrated by machines, unlike the natural intelligence displayed by humans and animals, which involves consciousness and emotionality. The distinction between the former and the latter categories is often revealed by the acronym chosen.

founded 3 years ago
you are viewing a single comment's thread
view the rest of the comments
[–] [email protected] 6 points 5 months ago

This article got me curious about how these 1-bit models worked so I read up on it a bit.

https://arxiv.org/html/2402.11295v3

The model parameters aren't completely converted to 1-bit. It's decomposed into a sign matrix (the 1-bit part) and two full precision vectors which together make a rank 1 approximation of the original matrix. So if I understand correctly, this means everything still functions the same way as a regular transformer. Input vectors, intermediate values, and outputs, all are full precision and have no problem going through nonlinearities.