#DeepLearning - Posts

ramikrispin, 9 days ago to ArtificialIntelligence

(1/2) Congratulations to my friend Lior and his co-author Meysam for the release of their new book - Mastering NLP from Foundations to LLMs 🎉

I met Lior a few years ago at a conference, and since then, I have been following his work in the field of NLP ❤️.

#nlp #python #machinelearning #deeplearning #DataScience #LLM

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

ramikrispin, 9 days ago

(2/2) The book covers the following topics:
✅ Mathematical foundations of machine learning and NLP
✅ Data preprocessing techniques for text data
✅ Machine learning applications for NLP and text classification
✅ Deep learning methods for NLP and text applications
✅ Theory and design of Large Language Models
✅ Applications of LLM models
✅ LLM applications with Langchain

The book is for folks who are interested in getting started with NLP and those who wish to delve into LLM applications.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

telescoper.blog, 12 days ago to ai

Before I head off on a trip to various parts of not-Barcelona, I thought I’d share a somewhat provocative paper by David Hogg and Soledad Villar. In my capacity as journal editor over the past few years I’ve noticed that there has been a phenomenal increase in astrophysics papers discussing applications of various forms of Machine Leaning (ML). This paper looks into issues around the use of ML not just in astrophysics but elsewhere in the natural sciences.

The abstract reads:

Machine learning (ML) methods are having a huge impact across all of the sciences. However, ML has a strong ontology – in which only the data exist – and a strong epistemology – in which a model is considered good if it performs well on held-out training data. These philosophies are in strong conflict with both standard practices and key philosophies in the natural sciences. Here, we identify some locations for ML in the natural sciences at which the ontology and epistemology are valuable. For example, when an expressive machine learning model is used in a causal inference to represent the effects of confounders, such as foregrounds, backgrounds, or instrument calibration parameters, the model capacity and loose philosophy of ML can make the results more trustworthy. We also show that there are contexts in which the introduction of ML introduces strong, unwanted statistical biases. For one, when ML models are used to emulate physical (or first-principles) simulations, they introduce strong confirmation biases. For another, when expressive regressions are used to label datasets, those labels cannot be used in downstream joint or ensemble analyses without taking on uncontrolled biases. The question in the title is being asked of all of the natural sciences; that is, we are calling on the scientific communities to take a step back and consider the role and value of ML in their fields; the (partial) answers we give here come from the particular perspective of physics

arXiv:2405.18095

P.S. The answer to the question posed in the title is probably “yes”.

https://telescoper.blog/2024/05/30/is-machine-learning-good-or-bad-for-the-natural-sciences/

#AI #ArtificialIntelligence #arXiv240518095 #Astrophysics #Cosmology #DataScience #deepLearning #MachineLearning

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ ubi

metin, 12 days ago to ai

𝘝𝘦𝘳𝘺 𝘍𝘦𝘸 𝘗𝘦𝘰𝘱𝘭𝘦 𝘈𝘳𝘦 𝘜𝘴𝘪𝘯𝘨 '𝘔𝘶𝘤𝘩 𝘏𝘺𝘱𝘦𝘥' 𝘈𝘐 𝘗𝘳𝘰𝘥𝘶𝘤𝘵𝘴 𝘓𝘪𝘬𝘦 𝘊𝘩𝘢𝘵𝘎𝘗𝘛, 𝘚𝘶𝘳𝘷𝘦𝘺 𝘍𝘪𝘯𝘥𝘴

https://slashdot.org/story/24/05/30/0238230/very-few-people-are-using-much-hyped-ai-products-like-chatgpt-survey-finds

#AI #ArtificialIntelligence #ML #MachineLearning #DeepLearning #LLM #LLMs #OpenAI #news #tech #technology

reply

expand (4)

collapse (4)

report

activity

copy /kbin url

copy original url

open original url

Loading...

daniel, 12 days ago

@metin That's interesting because in my circle (tech-savvy nerds and researchers) a lot of people use and recommend the use of ChatGPT. For example, the tutor of a scientific containerization course I attended last week used ChatGPT extensively to solve some very specific problems. Of course, you could get the same results using search engines, but an AI is much faster in these cases and can at least point you in the right direction.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

metin, 12 days ago

@daniel Yes, I think it's a matter of time before AI will be widely used. Personally, I barely use ChatGPT, because I don't trust the output yet, due to the hallucinations. I'm waiting until that has been solved. But I know that it's already usable for exact purposes like coding.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

lampinen, 19 days ago to ArtificialIntelligence

How well can we understand an LLM by interpreting its representations? What can we learn by comparing brain and model representations? Our new paper (https://arxiv.org/abs/2405.05847) highlights intriguing biases in learned feature representations that make interpreting them more challenging! 1/9
#intrepretability #deeplearning #representation #transformers

reply

expand (11)

collapse (11)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ cigitalgem

lampinen, 19 days ago

For example, if we train a model to compute a simple, linear feature and a hard, highly non-linear one, the easy feature is naturally learned first, but both are generalized perfectly by the end of training. However, the easy feature dominates the representations! 3/9

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ cigitalgem

lampinen, 19 days ago

This paper is really just us finally following up on a weird finding about RSA (figure on the here) from a paper Katherine Hermann & I had at NeurIPS back in the dark ages (2020): https://x.com/khermann_/status/1323353860283326464
Thanks to my coauthors @scychan_brains & Katherine! 9/9

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ cigitalgem

ramikrispin, 20 days ago to llm

Fine Tuning LLM Models – Generative AI Course 👇🏼

FreeCodeCamp released today a new course for fine tuning LLM models. The course, by Krish Naik, focuses on different tuning methods such as QLORA, LORA, and Quantization using different models such as Llama2, Gradient, and Google Gemma model.

📽️: https://www.youtube.com/watch?v=iOdFUJiB0Zc

#llm #DataScience #MachineLearning #genai #deeplearning

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

HxxxKxxx, 20 days ago to ArtificialIntelligence German

Vom 16.9.-19.9.2024 richten wir an der Universität zu Köln wieder eine Sommerschule zum Thema
"Deep Learning for Language Analysis“ aus,

Weitere Informationen: http://ml-school.uni-koeln.de/

#DeepLearning #LanguageAnalysis #SummerSchool #UniKöln #MLSchool

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ CerstinMahlow

SIB, 21 days ago to ArtificialIntelligence

“The Protein Universe Atlas is a groundbreaking resource for exploring the diversity of proteins. Its user-friendly web interface empowers researchers, biocurators and, students in navigating the “dark matter” to explore proteins of unknown function.”

🥁 That’s what the committee said about this work, one of the #SIBRemarkableOutputs 2023 👏

👉 Find out more about this and the other outputs: https://tinyurl.com/ye2yrpxx

#deeplearning #proteins

video/mp4

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ wordshaper

koen, 21 days ago to ArtificialIntelligence

Paul Gerke presents on #deeplearning infrastructure for #medical #image #analysis at @nluug #nluug #vj2024

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

metin, 22 days ago (edited 22 days ago) to ai

So… Big Tech is allowed to blatantly steal the work, styles and therewith the job opportunities of thousands of artists and writers without being reprimanded, but it takes similarity to the voice of a famous actor to spark public outrage about AI. 🤔

https://www.theregister.com/2024/05/21/scarlett_johansson_openai_accusation/

#AI #ArtificalIntelligence #ML #MachineLearning #DeepLearning #LLM #LLMs #OpenAI #SamAltman

reply

expand (4)

collapse (4)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ wonziu

rubinjoni, 22 days ago

@metin Better late, than never.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

metin, 22 days ago

@rubinjoni Definitely. 👍

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

ramikrispin, 23 days ago to machinelearning

MLX Examples 🚀

The MLX is Apple's framework for machine learning applications on Apple silicon. The MLX examples repository provides a set of examples for using the MLX framework. This includes examples of:
✅ Text models such as transformer, Llama, Mistral, and Phi-2 models
✅ Image models such as Stable Diffusion
✅ Audio and speech recognition with OpenAI's Whisper
✅ Support for some Hugging Face models

🔗 https://github.com/ml-explore/mlx-examples

#MachineLearning #llm #deeplearning #DataScience #Python

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

Lobrien, 23 days ago

@ramikrispin @BenjaminHan How do this and corenet (https://github.com/apple/corenet) fit together? The corenet repo has examples for inference with MLX for models trained with corenet; is that it, does MLX not have, e.g., activation and loss fns, optimizers, etc.?

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

ramikrispin, 23 days ago

@Lobrien @BenjaminHan The corenet is deep learning application where the MLX is array framework for high performance on Apple silicon. This mean that if you are using mac with M1-3 CPU it should perform better when using MLX on the backend (did not test it myself)

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

metin, 25 days ago (edited 25 days ago) to ai

Interesting article, discussing the status of OpenAI and the recent resignations of Ilya Sutskever and Jan Leike…

https://www.vox.com/future-perfect/2024/5/17/24158478/openai-departures-sam-altman-employees-chatgpt-release

#AI #ArtificialIntelligence #ML #MachineLearning #DeepLearning #LLM #LLMs #tech #technology #OpenAI #SamAltman #BigTech #article #read #reading #news

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

metin, 26 days ago to ai

AI… There's no escaping the scraping.

#AI #ArtificialIntelligence #ML #MachineLearning #DeepLearning #LLM #LLMs #tech #technology #BigTech #Google #Microsoft #OpenAI #Meta #Apple

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

ramikrispin, 26 days ago to ArtificialIntelligence

(1/2) MIT Introduction to Deep Learning 🚀🚀🚀

MIT launched the 2024 edition of the Introduction to Deep Learning course by Prof. Alexander Amini and Prof.Ava Amini. The course started at the end of April and will run until June. The course lectures are published weekly. The course syllabus keeps changing from year to year, reflecting the rapid changes in this field.

#deeplearning #MachineLearning #DataScience #AI #genai #python

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

ramikrispin, 26 days ago

(2/2) The course covers the following topics:
✅ Deep learning foundation
✅ Computer vision
✅ Deep generative modeling
✅ Reinforcement learning
✅ Robot learning
✅ Text to image

Resources 📚
Course website 🔗: http://introtodeeplearning.com/
Video lectures 📽️: https://www.youtube.com/playlist?list=PLtBw6njQRU-rwp5__7C0oIVt26ZgjG9NI

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

ramikrispin, 28 days ago to datascience

(1/2) Happy Tuesday! ☀️

Deep Generative Models - New Stanford Course 🚀👇🏼

Stanford University released a new course last week focusing on Deep Generative Models. The course, by Prof. Stefano Ermon, focuses on the models beyond GenAI models.

#genai #DataScience #MachineLearning #deeplearning

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

metin, 29 days ago to ai

Modern times summarized in one headline...

This AI software is helping schools detect guns

https://www.fastcompany.com/91123682/schools-detect-guns-ai-software-zeroeyes-kansas

#AI #ArtificialIntelligence #ML #MachineLearning #DeepLearning #LLM #LLMs #guns #school #schools #software

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...