Creating ReAct AI Agents with Mistral-7B/Mixtral and Ollama using Recipes

OpenAI's STUNS with "OMNI" Launch - FULL Breakdown

Inside the LLM: Visualizing the Embeddings Layer of Mistral-7B and Gemma-2B

ФІНАЛ ЄВРОБАЧЕННЯ-2024 | Передшоу і коментаторська студія з Тімуром Мірошниченком і Василем Байдаком

ПЕЙ МОЛОКО КАК ФОКУСНИК

顔面水槽がブサイク過ぎるwwwww

why llama-3-8B is 8 billion parameters instead of 7?

Chris Hay

Переглядів 2 830

Додати в
- Мій плейлист
- Переглянути пізніше
Поділитися

Поділитися

Вставка

Розмір відео:

Показувати елементи керування програвачем

Автоматичне відтворення

Автоповтор

Опубліковано 20 кві 2024
llama-3 has ditched it's tokenizer and has instead opted to use the same tokenizer as gpt-4 (tiktoken created by openai), it's even using the same first 100K token vocabulary.
In this video chris walks through why Meta has switched tokenizer and the implications on the model sizes, embeddings layer and multi-lingual tokenization.
he also runs his tokenizer benchmark and show's how it's more efficient in languages such as japanese
repos
------
github.com/chrishayuk/embeddings
github.com/chrishayuk/tokeniz...
Наука та технологія

КОМЕНТАРІ • 11

@charbakcg 22 дні тому
Excellent demonstration Chris , thanks for sharing!
@goodtothinkwith 22 дні тому
Great stuff.. no nonsense presentation style, clear and technical, as it should be 😅.. question: is there a reason why it’s not better to have common English syllables in the vocabulary? I understand “lov” being there, but I can’t imagine that “el” is a very useful token as part of “Lovelace”.. intuitively, I would think that is should simply be tokenized as “love” and “lace”
@rluijk 21 день тому
ok, that is all very concrete! Awesome. Thanks for this. This seems like a lot of quick wins that are easy to discover, or is that because hindsight by you explaining it so clearly? Anyway, its all a bit new to me. Perhaps, lets say Norway, would be wise to run this with their own tokeniser? Or is that to simplistic thinking?
@aaravsethi6070 23 дні тому ⁺²
Im super excited to see the `llama.cpp`, `llama2.c`, etc. category be implemented for llama3!
@chrishayuk 22 дні тому
Agree
@ArseniyPotapov 22 дні тому
llama.cpp already supports Llama3
@leeme179 22 дні тому ⁺¹
great video, thank you
@chrishayuk 22 дні тому
Thank you, glad it was useful
@leeme179 22 дні тому ⁺¹
What are you thought on including space in the tokenizer? I tried it once and the LLM was optimising to predict spaces as those easy wins for the LLM, but I like the way tiktoken has done to keep the space but not space as a token on it own....
@chrishayuk 22 дні тому
I’m okay with it, if you watch my visualizing embeddings layer video you’ll see that words with spaces and words without spaces are so closely correlated on the initial embeddings layer that it’s basically a non issue. The cost however is the size of the vocabulary and therefore the embeddings layer size. It does however make the model much more efficient not having spaces handled separately. So having words with spaces as its own token makes so much more sense
@rogerc7960 22 дні тому
Why is there some pytorch? Does finetuned or merged versions need it?

Наступне

Автоматичне відтворення

Creating ReAct AI Agents with Mistral-7B/Mixtral and Ollama using Recipes

Creating ReAct AI Agents with Mistral-7B/Mixtral and Ollama using Recipes

OpenAI's STUNS with "OMNI" Launch - FULL Breakdown

OpenAI's STUNS with "OMNI" Launch - FULL Breakdown

Inside the LLM: Visualizing the Embeddings Layer of Mistral-7B and Gemma-2B

Inside the LLM: Visualizing the Embeddings Layer of Mistral-7B and Gemma-2B

ФІНАЛ ЄВРОБАЧЕННЯ-2024 | Передшоу і коментаторська студія з Тімуром Мірошниченком і Василем Байдаком

ФІНАЛ ЄВРОБАЧЕННЯ-2024 | Передшоу і коментаторська студія з Тімуром Мірошниченком і Василем Байдаком

ПЕЙ МОЛОКО КАК ФОКУСНИК

ПЕЙ МОЛОКО КАК ФОКУСНИК

顔面水槽がブサイク過ぎるwwwww

顔面水槽がブサイク過ぎるwwwww

Новая технология! РАССЫПНОЙ ПОДШИПНИК

Новая технология! РАССЫПНОЙ ПОДШИПНИК

This New Photonic Chip Computes in Femtoseconds

This New Photonic Chip Computes in Femtoseconds

NOX vs TOX - WHAT are they for & HOW do you CHOOSE? 🐍

NOX vs TOX – WHAT are they for & HOW do you CHOOSE? 🐍

ChatGPT’s Amazing New Model Feels Human (and it's Free)

ChatGPT’s Amazing New Model Feels Human (and it's Free)

"okay, but I want Llama 3 for my specific use case" - Here's how

"okay, but I want Llama 3 for my specific use case" - Here's how

LLAMA 3 : Explained and Summarised Under 8 Minutes (Compared to Llama 2, Meta AI)

LLAMA 3 : Explained and Summarised Under 8 Minutes (Compared to Llama 2, Meta AI)

how the tokenizer for gpt-4 (tiktoken) works and why it can't reverse strings

how the tokenizer for gpt-4 (tiktoken) works and why it can't reverse strings

LangGraph 101: it's better than LangChain

LangGraph 101: it's better than LangChain

Metas LLAMA 3 Just STUNNED Everyone! (Open Source GPT-4)

Metas LLAMA 3 Just STUNNED Everyone! (Open Source GPT-4)

"I want Llama3 to perform 10x with my private knowledge" - Local Agentic RAG w/ llama3

"I want Llama3 to perform 10x with my private knowledge" - Local Agentic RAG w/ llama3

Apple Event - May 7

Apple Event - May 7

Обзор Intel N100 | Лучший процессор для дешевых мини-ПК и ноутбуков

Обзор Intel N100 | Лучший процессор для дешевых мини-ПК и ноутбуков

Why spend $10.000 on a flashlight when these are $200🗿

Why spend $10.000 on a flashlight when these are $200🗿

ЧАСТЬ 7 ПРОСТО ХУДШАЯ СБОРКА ПК, КОТОРУЮ МОЖНО БЫЛО КУПИТЬ #сборкапк #пк #собратькомпьютер

ЧАСТЬ 7 ПРОСТО ХУДШАЯ СБОРКА ПК, КОТОРУЮ МОЖНО БЫЛО КУПИТЬ #сборкапк #пк #собратькомпьютер

📱 SAMSUNG, ЧТО С ЛИЦОМ? 🤡

📱 SAMSUNG, ЧТО С ЛИЦОМ? 🤡

iOS 17.5 - ЩО НОВОГО? Чи варто оновлюватися? ГОЛОВНІ ФІШКИ!

iOS 17.5 – ЩО НОВОГО? Чи варто оновлюватися? ГОЛОВНІ ФІШКИ!

ИДЕАЛЬНЫЕ ДИОДЫ, с НЕВЕРОЯТНЫМ способом питания! Энергия из ниоткуда O_o

ИДЕАЛЬНЫЕ ДИОДЫ, с НЕВЕРОЯТНЫМ способом питания! Энергия из ниоткуда O_o

Я Создал Новый Айфон!

Я Создал Новый Айфон!