Attention Is All You Need (lemmy.intai.tech)
arxiv.org/pdf/1706.03762.pdf...
This magazine is from a federated server and may be incomplete. Browse more on the original instance.
arxiv.org/pdf/1706.03762.pdf...
Large Language Models as General Pattern Machines...
Title: ChatGPT an ENFJ, Bard an ISTJ: Empirical Study on Personalities of Large Language Models...
[Tweet](https://twitter.com/SourabhAgr03/status/1671884579203063812}...
Found here...
https://arxiv.org/pdf/2305.00118.pdf
Original LIMA Paper...
cross-posted from: https://sh.itjust.works/post/225391...
cross-posted from: https://sh.itjust.works/post/223997...
https://arxiv.org/pdf/2306.09479.pdf
https://twitter.com/_akhaliq/status/1670650476004556800 https://arxiv.org/pdf/2306.09557.pdf
https://arxiv.org/pdf/2306.09682.pdf
https://arxiv.org/pdf/2306.09896.pdf
https://arxiv.org/pdf/2306.09255.pdf
https://arxiv.org/pdf/2302.01308.pdf...
https://arxiv.org/pdf/2305.11206.pdf
https://arxiv.org/pdf/2306.08302.pdf
Paper Title: Orca: Progressive Learning from Complex Explanation Traces of GPT-4...
Title: From Pixels to UI Actions: Learning to Follow Instructions via Graphical User Interfaces...
The content discusses a proposed method called Fine-Grained Reinforcement Learning from Human Feedback (Fine-Grained RLHF) for improving language model training using fine-grained human feedback. It shows how providing dense rewards after small text segments for specific types of undesired behaviors can enable more effective...
https://lemmy.intai.tech/pictrs/image/2467a558-b2c9-48cb-b425-8c157cd6eef5.png