The Allen Institute for AI (AI2) has announced the release of Tülu 3, a state-of-the-art family of instruction-following models designed to set a new benchmark in AI capabilities. This release includes state-of-the-art features, methodologies, and tools, providing researchers and developers with a comprehensive, open-source solution. With Tülu 3, AI2 has successfully addressed a broad range of tasks, from conversational AI to complex problem-solving domains such as mathematics, reasoning, and evaluation. Tülu 3 is a model family prioritizing transparency, openness, and state-of-the-art performance. The models are based on Meta’s Llama 3.1 framework and have been fine-tuned on an extensive dataset mix comprising publicly available, synthetic, and human-created data. This approach ensures that Tülu 3 achieves excellence across diverse tasks, including specialized domains like MATH, GSM8K, and IFEval while maintaining strong capabilities in general-purpose chat and reasoning tasks. Read the full article here: www.marktechpost.com/2024/11/21/the-allen-institute-for-ai-ai2-releases-tulu-3-a-set-of-state-of-the-art-instruct-models-with-fully-open-data-eval-code-and-training-algorithms/ Tülu 3 8B (Llama-3.1-Tulu-3-8B): huggingface.co/allenai/Llama-3.1-Tulu-3-8B Tülu 3 70B (Llama-3.1-Tulu-3-70B): huggingface.co/allenai/Llama-3.1-Tulu-3-70B Audio Created by NotebookLLM and reviewed by real human 👉 Don’t Forget to join our 55k+ ML SubReddit: www.reddit.com/r/machinelearningnews/ ⚓ Feel free to subscribe to our AI Research Newsletter read by 30k+ AI and Data Professionals: airesearchinsights.com/subscribe
The Allen Institute for AI (AI2) has announced the release of Tülu 3, a state-of-the-art family of instruction-following models designed to set a new benchmark in AI capabilities. This release includes state-of-the-art features, methodologies, and tools, providing researchers and developers with a comprehensive, open-source solution. With Tülu 3, AI2 has successfully addressed a broad range of tasks, from conversational AI to complex problem-solving domains such as mathematics, reasoning, and evaluation.
Tülu 3 is a model family prioritizing transparency, openness, and state-of-the-art performance. The models are based on Meta’s Llama 3.1 framework and have been fine-tuned on an extensive dataset mix comprising publicly available, synthetic, and human-created data. This approach ensures that Tülu 3 achieves excellence across diverse tasks, including specialized domains like MATH, GSM8K, and IFEval while maintaining strong capabilities in general-purpose chat and reasoning tasks.
Read the full article here: www.marktechpost.com/2024/11/21/the-allen-institute-for-ai-ai2-releases-tulu-3-a-set-of-state-of-the-art-instruct-models-with-fully-open-data-eval-code-and-training-algorithms/
Tülu 3 8B (Llama-3.1-Tulu-3-8B): huggingface.co/allenai/Llama-3.1-Tulu-3-8B
Tülu 3 70B (Llama-3.1-Tulu-3-70B): huggingface.co/allenai/Llama-3.1-Tulu-3-70B
Audio Created by NotebookLLM and reviewed by real human
👉 Don’t Forget to join our 55k+ ML SubReddit: www.reddit.com/r/machinelearningnews/
⚓ Feel free to subscribe to our AI Research Newsletter read by 30k+ AI and Data Professionals: airesearchinsights.com/subscribe