Fine-tune High Performance Sentence Transformers (with Multiple Negatives Ranking)

Intro to Sentence Embeddings with Transformers

Semantic Chunking for RAG

Этот бой - Самое большое РАЗОЧАРОВАНИЕ за всю КАРЬЕРУ БУАКАВА!

КТО НЕ ДВИНЕТСЯ, ПОЛУЧИТ МАШИНУ!

НА ЦЕ можна дивитись ВІЧНО! Такої ПАЛКОЇ зустрічі НІХТО НЕ ЧЕКАВ

Fine-tune Sentence Transformers the OG Way (with NLI Softmax loss)

James Briggs

Переглядів 9 539

Додати в
- Мій плейлист
- Переглянути пізніше
Поділитися

Поділитися

Вставка

Розмір відео:

Показувати елементи керування програвачем

Автоматичне відтворення

Автоповтор

Опубліковано 22 січ 2025

КОМЕНТАРІ • 15

@eugenechua7239 Рік тому ⁺¹
Hey james, will you be sharing the notebook?
@NitishKumar-pg6qw Рік тому
How can we put two sentences closer in vector space ? Can we use this approach?
@benp.9225 Рік тому
In the Paper they talk about "trainable" weights for the softmax classification. Can you not access those weights after training from the SBERT Modell (e.g. to make predictions)? Or is the Cosine-Distance the only way to use the model?
@yangwang9688 2 роки тому
So you don't update the weight of `ffnn`?
@souravsamanta3354 2 роки тому
@james Briggs: I am not clear about this aspect: When you are doing the training using the Pytorch way, you perform concatenation operations explicitly in your code... but if training is done using the Sentence Transformer framework, we don't see any concatenation... Is it being handled automatically by the library ???
@jamesbriggs 2 роки тому ⁺¹
Yes that’s right, the library handles it automatically
@souravsamanta3354 2 роки тому
@@jamesbriggs Thank you so much for the clarification and the wonderful tutorial... I have one more followup question... In the Pytorch implementation, you have added a FFNN after the concatenated tensor... but for Sentence BERT there is no dense layer after the pooling operation... Is my understanding correct ?
@пчелобавурод Рік тому
Why don't you freeze the layers?
@1lyf 2 роки тому
Hello James,
I tried to find tune the model. I have a 3050 with 4gb and when I try to fit the model with batch size of 16 i get cuda out of memory error. I just ran your exact code shown in the video. The difference is that my data is just 5000 rows. Could you please advise how to solve this?
@jianfengzhu2100 2 роки тому
I had the same issue. I reduced the data size and then it went through. I am wondering how to fix the problem.
@holthuizenoemoet591 Рік тому
Love this intro!
@zacharyadams3772 2 роки тому
Can you make that title card into a png I can use for my background?
@cameron.willis 3 роки тому
Another great video! Thanks James! I'm hoping my team can take your Udemy course soon!
@jamesbriggs 3 роки тому
That's awesome, it's included in Udemy for Business too :)
@machinelearning3518 2 роки тому ⁺¹
Take me in your Team

Наступне

Автоматичне відтворення

Fine-tune High Performance Sentence Transformers (with Multiple Negatives Ranking)

Fine-tune High Performance Sentence Transformers (with Multiple Negatives Ranking)

Intro to Sentence Embeddings with Transformers

Intro to Sentence Embeddings with Transformers

Semantic Chunking for RAG

Semantic Chunking for RAG

Этот бой - Самое большое РАЗОЧАРОВАНИЕ за всю КАРЬЕРУ БУАКАВА!

Этот бой - Самое большое РАЗОЧАРОВАНИЕ за всю КАРЬЕРУ БУАКАВА!

КТО НЕ ДВИНЕТСЯ, ПОЛУЧИТ МАШИНУ!

КТО НЕ ДВИНЕТСЯ, ПОЛУЧИТ МАШИНУ!

НА ЦЕ можна дивитись ВІЧНО! Такої ПАЛКОЇ зустрічі НІХТО НЕ ЧЕКАВ

НА ЦЕ можна дивитись ВІЧНО! Такої ПАЛКОЇ зустрічі НІХТО НЕ ЧЕКАВ

Перший наступ КНДРівців

Перший наступ КНДРівців

Fine-tuning Large Language Models (LLMs) | w/ Example Code

Fine-tuning Large Language Models (LLMs) | w/ Example Code

Sentence Transformers: Sentence Embedding, Sentence Similarity, Semantic Search and Clustering |Code

Sentence Transformers: Sentence Embedding, Sentence Similarity, Semantic Search and Clustering |Code

Transformers (how LLMs work) explained visually | DL5

Transformers (how LLMs work) explained visually | DL5

Visualizing transformers and attention | Talk for TNG Big Tech Day '24

Visualizing transformers and attention | Talk for TNG Big Tech Day '24

Hugging Face Transformers: the basics. Practical coding guides SE1E1. NLP Models (BERT/RoBERTa)

Hugging Face Transformers: the basics. Practical coding guides SE1E1. NLP Models (BERT/RoBERTa)

The Secret to 90%+ Accuracy in Text Classification

The Secret to 90%+ Accuracy in Text Classification

3 Vector-based Methods for Similarity Search (TF-IDF, BM25, SBERT)

3 Vector-based Methods for Similarity Search (TF-IDF, BM25, SBERT)

Intro to Dense Vectors for NLP and Vision

Intro to Dense Vectors for NLP and Vision

Today Unsupervised Sentence Transformers, Tomorrow Skynet (how TSDAE works)

Today Unsupervised Sentence Transformers, Tomorrow Skynet (how TSDAE works)

Удержаться на воде?? 🌊 #симбочкапимпочка #симбочка #симба

Удержаться на воде?? 🌊 #симбочкапимпочка #симбочка #симба

How to treat Acne💉

How to treat Acne💉

Тайское мороженое в Калининграде

Тайское мороженое в Калининграде

Син ПОВАЛІЙ ПЛЮНУВ ЇЙ в ОБЛИЧЧЯ! Скандальне ПРИВІТАННЯ для ЗРАДНИЦІ! | OBOZ.LIFE

Син ПОВАЛІЙ ПЛЮНУВ ЇЙ в ОБЛИЧЧЯ! Скандальне ПРИВІТАННЯ для ЗРАДНИЦІ! | OBOZ.LIFE

Рабочий способ бросить вредную привычку

Рабочий способ бросить вредную привычку

Ветеран війни отримав гроші на житло

Ветеран війни отримав гроші на житло

🔥"СВОшник" РОЗНОСИТЬ шоу путіністів! Ведучий ШОКОВАНИЙ від цих СЛІВ #shorts

🔥"СВОшник" РОЗНОСИТЬ шоу путіністів! Ведучий ШОКОВАНИЙ від цих СЛІВ #shorts

Как найти себе жену? Больше - тут @stas.yornik.shorts

Как найти себе жену? Больше - тут @stas.yornik.shorts