- 126
- 74 409
dataroots
Belgium
Приєднався 7 кві 2020
The idea of dataroots research (dataroots.io/research/) is to establish a community to focus on applied AI research and to share knowledge and experiences on data & AI. We work in collaboration with academic institutions, government agencies and private organisations to further explore the possibilities of AI and investigate how solutions can be applied in different business models. It is a place to foster academic output in an applied research setting to create SOTA prototypes about AI solutions. In dataroots research, we welcome students from academic institutions to do internships and collaborate on their thesis subjects related to data & AI. We also support and train people in achieving more with Data & AI and grow their expertise.
We organize different events:
• meetup@lunch: Easy bite webinars covering a wide range of topics on applied AI.
• rootcamps: Open sessions to involve and introduce community
• rootfood: Informal breakfast bringing together players in AI.
We organize different events:
• meetup@lunch: Easy bite webinars covering a wide range of topics on applied AI.
• rootcamps: Open sessions to involve and introduce community
• rootfood: Informal breakfast bringing together players in AI.
Hello 2025! OpenAI’s O3, Deep Seek V3, Bolt.new and Doom Goes Artsy | DataTopics Unplugged #74
Welcome to the cozy corner of the tech world where ones and zeros mingle with casual chit-chat. Datatopics Unplugged is your go-to spot for relaxed discussions around tech, news, data, and society.
Dive into conversations that flow as smoothly as your morning coffee (but don't), where industry insights meet laid-back banter. Whether you're a data aficionado or just someone curious about the digital age, pull up a chair, relax, and let's get into the heart of data, unplugged style!
In this episode, we explore:
OpenAI’s O3: Features, O1 Comparison, Release Date & more.
www.datacamp.com/blog/o3-openai
Advent of Code: How LLMs performed on the 2024 coding challenges.
www.jerpint.io/blog/advent-of-code-llms/
DeepSeek V3: A breakthrough AI model developed for a fraction of GPT-4’s cost, yet rivaling top benchmarks.
www.deepseek.com
Shadow Workspace: How Cursor compares to Copilot with features like integrated models, documentation, and search.
www.cursor.com/blog/shadow-workspace
Bolt.new: Why it’s poised to revolutionize web app development with prompt-driven innovation.
bolt.new
O1 Preview’s Chess Hack: When smarter means “cheater” in a fascinating experiment against Stockfish.
www.reddit.com/r/singularity/comments/1hodklk/more_scheming_detected_o1preview_autonomously/?rdt=54955
Pydantic AI: A new tool bringing structure and intelligence to Python’s AI workflows.
github.com/pydantic/pydantic-ai
RightTyper: A tool to infer and apply type hints for cleaner, more efficient Python code.
github.com/RightTyper/RightTyper
Doom: The Gallery Experience: A whimsical take on art appreciation in a retro gaming environment.
bobatealee.itch.io/doom-the-gallery-experience
Suno V4: The next-gen music generator, featuring "Bart, the Data Dynamo."
suno.com/blog/v4
Ghostty Terminal: The terminal emulator developers are raving about.
ghostty.org
#podcast #techpodcast #ai #AI #AI #ArtificialIntelligence #OpenAI #DeepSeek #BoltNew #AdventOfCode #O3Model #AIRevolution #MachineLearning #DataScience #Coding #SoftwareDevelopment #PythonProgramming #PydanticAI #TechNews #ShadowWorkspace #LLM #TechInnovation #WebDevelopment #GPTModels #AIModels #FutureOfAI #AIApplications #DataEngineering #AIandCoding #AIinTech #GenerativeAI #TechPodcast #ProgrammingTools #AIInnovation #BigData #DeepLearning #AI2025 #CodingChallenges #AIResearch #AIModelsExplained #PythonTools #AIandData #DataDriven #OpenSourceTools #FutureTech #WebApps #AITrends #AIInAction #PythonCoding #AIExploration #SunoV4 #GhosttyTerminal #TechTools #AICommunity #AIInsights
00:00:02 OpenAI Releases O3 and Neural Networks
00:08:27 Advancements in AI Performance and Cost
00:17:00 Developing AI-enhanced IDEs and Web Apps
00:28:53 AI-Enhanced Web Apps and Development
00:41:3 Python Type Systems and Tooling
00:49:16 Python Type System and RightTyper
01:01:24 AI in Music and Terminal Emulators
Dive into conversations that flow as smoothly as your morning coffee (but don't), where industry insights meet laid-back banter. Whether you're a data aficionado or just someone curious about the digital age, pull up a chair, relax, and let's get into the heart of data, unplugged style!
In this episode, we explore:
OpenAI’s O3: Features, O1 Comparison, Release Date & more.
www.datacamp.com/blog/o3-openai
Advent of Code: How LLMs performed on the 2024 coding challenges.
www.jerpint.io/blog/advent-of-code-llms/
DeepSeek V3: A breakthrough AI model developed for a fraction of GPT-4’s cost, yet rivaling top benchmarks.
www.deepseek.com
Shadow Workspace: How Cursor compares to Copilot with features like integrated models, documentation, and search.
www.cursor.com/blog/shadow-workspace
Bolt.new: Why it’s poised to revolutionize web app development with prompt-driven innovation.
bolt.new
O1 Preview’s Chess Hack: When smarter means “cheater” in a fascinating experiment against Stockfish.
www.reddit.com/r/singularity/comments/1hodklk/more_scheming_detected_o1preview_autonomously/?rdt=54955
Pydantic AI: A new tool bringing structure and intelligence to Python’s AI workflows.
github.com/pydantic/pydantic-ai
RightTyper: A tool to infer and apply type hints for cleaner, more efficient Python code.
github.com/RightTyper/RightTyper
Doom: The Gallery Experience: A whimsical take on art appreciation in a retro gaming environment.
bobatealee.itch.io/doom-the-gallery-experience
Suno V4: The next-gen music generator, featuring "Bart, the Data Dynamo."
suno.com/blog/v4
Ghostty Terminal: The terminal emulator developers are raving about.
ghostty.org
#podcast #techpodcast #ai #AI #AI #ArtificialIntelligence #OpenAI #DeepSeek #BoltNew #AdventOfCode #O3Model #AIRevolution #MachineLearning #DataScience #Coding #SoftwareDevelopment #PythonProgramming #PydanticAI #TechNews #ShadowWorkspace #LLM #TechInnovation #WebDevelopment #GPTModels #AIModels #FutureOfAI #AIApplications #DataEngineering #AIandCoding #AIinTech #GenerativeAI #TechPodcast #ProgrammingTools #AIInnovation #BigData #DeepLearning #AI2025 #CodingChallenges #AIResearch #AIModelsExplained #PythonTools #AIandData #DataDriven #OpenSourceTools #FutureTech #WebApps #AITrends #AIInAction #PythonCoding #AIExploration #SunoV4 #GhosttyTerminal #TechTools #AICommunity #AIInsights
00:00:02 OpenAI Releases O3 and Neural Networks
00:08:27 Advancements in AI Performance and Cost
00:17:00 Developing AI-enhanced IDEs and Web Apps
00:28:53 AI-Enhanced Web Apps and Development
00:41:3 Python Type Systems and Tooling
00:49:16 Python Type System and RightTyper
01:01:24 AI in Music and Terminal Emulators
Переглядів: 21
Відео
Generative AI: Hype vs. Reality - Insights from RootsConf | DataTopics Unplugged #71
Переглядів 298Місяць тому
Welcome to the cozy corner of the tech world where ones and zeros mingle with casual chit-chat. Datatopics Unplugged is your go-to spot for relaxed discussions around tech, news, data, and society. This week, we’re bringing you a special episode straight from RootsConf, our annual internal knowledge-sharing extravaganza! Hosts Murilo and Bart sit down with Tim and Ben, data strategy experts, fo...
What's Next for AI? A Recap of 2024 and Predictions for 2025
Переглядів 71Місяць тому
Welcome to the cozy corner of the tech world where ones and zeros mingle with casual chit-chat. Datatopics Unplugged is your go-to spot for relaxed discussions around tech, news, data, and society. This week, Yannick joins the conversation for a lively year-end retrospective on the state of AI, data, and technology in 2024. Whether you're knee-deep in neural networks or just data-curious, this ...
From Engineer to CEO: Alex Gallego on Building Red Panda | DataTopics Unplugged #69
Переглядів 63Місяць тому
Welcome to the cozy corner of the tech world where ones and zeros mingle with casual chit-chat. Datatopics Unplugged is your go-to spot for relaxed discussions around tech, news, data, and society. In this episode, we’re joined by a special guest: Alex Gallego, founder and CEO of Red Panda. Together, we dive deep into building data-intensive applications, the evolution of streaming technologies...
What Happens When GenAI Meets Minecraft? Plus, OpenAI’s O1 Leak & more | DataTopics Unplugged #68
Переглядів 462 місяці тому
Welcome to the cozy corner of the tech world where ones and zeros mingle with casual chit-chat. Datatopics Unplugged is your go-to spot for relaxed discussions around tech, news, data, and society. In this episode, we are joined by special guest Nico for a lively and wide-ranging tech chat. Grab your headphones and prepare for: Strava’s ‘Athlete Intelligence’ feature: A humorous dive into how w...
The AI Race: ChatGPT's New Web Search, Meta’s AI, & Python 3.13's Upgrade| DataTopics Unplugged #67
Переглядів 922 місяці тому
Welcome to the cozy corner of the tech world where ones and zeros mingle with casual chit-chat. Datatopics Unplugged is your go-to spot for relaxed discussions around tech, news, data, and society. Dive into conversations that should flow as smoothly as your morning coffee (but don't), where industry insights meet laid-back banter. Whether you're a data aficionado or just someone curious about ...
From Will Smith to MovieGen: How AI Videos Got So Real | DataTopics Unplugged #66
Переглядів 512 місяці тому
Welcome to Datatopics Unplugged, where the tech world’s buzz meets laid-back banter. In each episode, we dive into the latest in AI, data science, and technology-perfect for your inner geek or curious mind. Pull up a seat, tune in, and join us for insights, laughs, and the occasional hot take on the digital world. Meta’s video generation breakthrough: Explore Meta’s new “MovieGen” model family ...
The Art of Data Storytelling: A Deep Dive with Angelica Lo Duca | DataTopics Unplugged #65
Переглядів 1042 місяці тому
Welcome to the cozy corner of the tech world where ones and zeros mingle with casual chit-chat. Datatopics Unplugged is your go-to spot for relaxed discussions around tech, news, data, and society. In this episode, we dive into the world of data storytelling with special guest Angelica Lo Duca, a professor, researcher, and author. Pull up a chair as we explore her journey from programming to te...
Python WTF moments, Rust rants & Quantum flops | DataTopics Unplugged #64
Переглядів 832 місяці тому
Ever wondered if type hints in Python truly prevent coding errors, or if Rust's performance claims justify a complete rewrite of your projects? This episode delves into the technical intricacies of programming, offering insights into Python's type checkers and the cognitive challenges posed by overloading. We compare the quirks of Python and JavaScript, highlighted by the infamous "WTF Python" ...
What’s Next for Open Source? Astral’s business model, WordPress, Deno 2.0 & One Year of DataTopics!
Переглядів 1013 місяці тому
Ever wondered what keeps the world of open-source ticking amidst commercial interests? Get ready to unravel the gripping dynamics of open-source governance, with WordPress controversies and the ElasticSearch versus AWS saga leading the charge. Join us as we spotlight the tension between free ideals and profit-driven ventures, with a keen eye on the impact of for-profit entities like Astral on t...
The End of Pandas, Rise of Ibis: AI, Function Calling, & New Tools | DataTopics Unplugged #62
Переглядів 1073 місяці тому
Welcome to the cozy corner of the tech world where ones and zeros mingle with casual chit-chat. Datatopics Unplugged is your go-to spot for relaxed discussions around tech, news, data, and society. We dive into conversations smoother than your morning coffee (but let’s be honest, just as caffeinated) where industry insights meet light-hearted banter. Whether you’re a data wizard or just curious...
First Look at OpenAI O1 'Strawberry': AI is Officially Smarter Than Humans
Переглядів 1013 місяці тому
Welcome to the cozy corner of the tech world where ones and zeros mingle with casual chit-chat. DataTopics Unplugged is your go-to spot for laid-back banter about the latest in tech, AI, and coding. In this episode, Jonas joins us with fresh takes on AI smarts, sneaky coding tips, and a spicy CI debate: OpenAI's GPT-01 ("Strawberry"): The team explores OpenAI’s newest model, its advanced reason...
My First Year as a Data & Cloud Engineer: What It’s Like Working at Dataroots | Careers@dataroots
Переглядів 1154 місяці тому
Find out what it's like to be a Data & Cloud Engineer at Dataroots through the eyes of Amal, who shares her experiences from her first year working at Dataroots. From attending the Roots Academy, to landing her first client in the financial sector, she shares her journey to becoming a Data & Cloud engineer, and what she values most in her role. Whether you're considering a career in data engine...
AI & the Paris 2024 Olympics: From Tech to Yusuf Dikec Memes | DataTopics Unplugged #60
Переглядів 2625 місяців тому
Get ready for an exciting episode where we dive into the world of AI at the Olympics! From groundbreaking tech innovations at Paris 2024 to the funniest Yusuf Dikec memes, we'll explore how AI is transforming sports, judging gymnastics, and even boosting sprinters like Zoe Hobbs. Plus, discover how detailed stats and bird’s eye views are giving us stunning insights into the 100m final. Don’t mi...
Did AI Accurately Predict the Euro 2024 Winners? (PART 2) | DataTopics Unplugged #59
Переглядів 1335 місяців тому
In this episode, we dive deep into the fascinating world of AI predictions in sports, with a special focus on the Euro 2024 final between Spain and England. Join us as we explore: AI Predictions Revisited: Reflecting on the previous episode about AI predictions and their accuracy, particularly Snowflake's prediction for Euro 2024. medium.com/snowflake/predicting-euro-2024-with-snowflake-ml-9b7c...
Maximizing Productivity: Bookmarklets, Q Command-Line, RouteLLM, and DuckDB Extensions
Переглядів 1226 місяців тому
Maximizing Productivity: Bookmarklets, Q Command-Line, RouteLLM, and DuckDB Extensions
Can the Music Industry Win the Battle Against AI? | DataTopics Unplugged #57
Переглядів 1276 місяців тому
Can the Music Industry Win the Battle Against AI? | DataTopics Unplugged #57
What Skills Do You Need to Become an AI Engineer? | DataTopics Unplugged #56
Переглядів 1166 місяців тому
What Skills Do You Need to Become an AI Engineer? | DataTopics Unplugged #56
Can AI Predict EURO 2024 Winners? | DataTopics Unplugged Podcast #55
Переглядів 2946 місяців тому
Can AI Predict EURO 2024 Winners? | DataTopics Unplugged Podcast #55
Is Apple Intelligence...Intelligent? & More Tech News | DataTopics Unplugged #54
Переглядів 957 місяців тому
Is Apple Intelligence...Intelligent? & More Tech News | DataTopics Unplugged #54
Can AI Replace Human Creativity? | DataTopics Unplugged #53
Переглядів 987 місяців тому
Can AI Replace Human Creativity? | DataTopics Unplugged #53
Can AI-Generated Voices Fool Us? Insights from 'De Mol' TV Show | DataTopics Unplugged #52
Переглядів 687 місяців тому
Can AI-Generated Voices Fool Us? Insights from 'De Mol' TV Show | DataTopics Unplugged #52
Is Data Science a Lonely Profession? | DataTopics Unplugged #51
Переглядів 907 місяців тому
Is Data Science a Lonely Profession? | DataTopics Unplugged #51
Where Will GPT-4o Take Us Next? Exploring AI's Future & more | DataTopics Unplugged #50
Переглядів 2047 місяців тому
Where Will GPT-4o Take Us Next? Exploring AI's Future & more | DataTopics Unplugged #50
How Can We Define DevRel in the Tech World? Insights with Mehdi Ouazza | DataTopics Unplugged #48
Переглядів 418 місяців тому
How Can We Define DevRel in the Tech World? Insights with Mehdi Ouazza | DataTopics Unplugged #48
Tech Check: Amazon's AI, Rust vs. Go and the Intricacies of AI in Coding | DataTopics Unplugged #45
Переглядів 1599 місяців тому
Tech Check: Amazon's AI, Rust vs. Go and the Intricacies of AI in Coding | DataTopics Unplugged #45
DataTopics Unplugged #38 Open Source AI, SQL Dialects, and New Terminals
Переглядів 7810 місяців тому
DataTopics Unplugged #38 Open Source AI, SQL Dialects, and New Terminals
Dataroots nomination at Leuven Innovation Awards 2022
Переглядів 1362 роки тому
Dataroots nomination at Leuven Innovation Awards 2022
Hi, great presentation, thank you
As bayrakları🇹🇷🇹🇷🇹🇷🇹🇷🇹🇷🇹🇷🇹🇷🇹🇷🇹🇷🇹🇷🇹🇷🇹🇷🇹🇷🇹🇷🇹🇷🇹🇷🇹🇷🇹🇷🇹🇷🇹🇷🇹🇷🇹🇷🇹🇷🇹🇷🇹🇷🇹🇷🇹🇷🇹🇷🇹🇷🇹🇷🇹🇷🇹🇷🇹🇷🇹🇷🇹🇷🇹🇷🇹🇷🇹🇷🇹🇷
Compensating the creators of the training data might sound like a good idea, but it breaks down in a few key areas: -Generative models can/will be created/modified/used locally by individuals and small groups, not just large companies, making enforcement difficult/impossible. -If you go so far as to ascribe authorship rights to generated works, it creates issues for licensing and copyright expiration. Additionally, the creator of the first instance of that element has likely been dead for over 70 years, which suggests the element should be in the public domain. -If the same element of a generated work is tied back to thousands (or more) of individuals, you can argue that element isn’t substantial enough to warrant copyright protection due to a lack of uniqueness.
Given that her movies are publicly available, I think it's very likely that they just went data scraping after she denied their offer.
Shitty Dutch accent
Just checking whether Murilo keeps his word on answering the comments 🙃
Hehehe I do my best 😅
this is not real time :(
Interesting thought process on whether we should care about writing dialect-inspecific SQL :) Maybe just embrace dialects and gain performance, clarity, usability, "you're not migrating everyday" - agree.
Thanks… 13:14 I have a question that why do you use precision, recall…. metrics (metrics for classification)? And how does model calculate that, because its not discrete value. I am a newbie
I think the lecturer misunderstands how pruning works. For it to be effective, there should be opportunity to continue or skip evaluation. In boosting or trees case, it's partial_fit. If you don;t utilize it, it does not matter that you receive ShouldPrune signal from the pruner: you are not exploiting it anyway, as you have already finished training & scoring. That's why with "pruning" the author had the same runtime as without it. Btw, Optuna docs suffer from initializing and partitioning data within the objective function. I don't understand why is everyone copying that without any thinking.
i know this comment is 2 years late, but does this work if i deploy my app to the internet, and it will access the users mic?
Can't run the code for tensorlow version greater than 1.15.0. How can I resolve this?
Very informative. Thanks Frederick it's a great presentation.
thanks when you run a test, the results do not look good. for those with 'interaction' equal to 1, the prediction should be close to 1. but this is not the case.
thanks. when you test, you are using data from training. i am referring to this line: long_test = wide_to_long( ) the parameter should be data['test'].. please correct me if i am wrong,
thanks. i see a problem with calling make_tf_dataset() just once for training. this function returns a size of 512 in tensor type. you are using this data just once for training. i think you need to put this into a loop. or make the batch size bigger. am i missing out in understanding?
ᎮᏒᎧᎷᎧᏕᎷ
Do we need historical data to train our mode in reinforcement learning?
Hi Arman, well you can start building your reinforcement learning algorithm from scratch, this is called online training and thus the model will learn as it sees more examples and becomes better (hence not a great model at the beginning) or if you have historical data you can use offline learning to already get a first model before using online training to improve it :)
@@dataroots so if i have historical states data, i can use this to train RL agent ? This would be called as offline training ?
My second question is, if interactions were not encoded as binary, but encoded as the actual ratings (explicit feedback rather implicit feedback), does your provided code still produce meaningful ncf_predcitons?
I believe it should (it's been a while). The only thing you want to modify is to normalize the actual ratings between 0 and 1.
I did not really understand what these ncf_predictions means for the prediction. Does higher ncf_prediction value for specific (user_id,item_id) means they should be recommended to the user? Then, during the recommendation phase, for every (user_id,item_id) pair, should I recommend the item_id with the highest ncf value to that user?
Yes, the highest predicted values that the user has not already seen/bought should be recommended. The ncf_predictions is basically the models' "guess" of whether you'd buy/watch by yourself (and we approximate "watched" = "liked").
@@murilo-cunha Thank you for the answers. Do you also have any recommendations to reduce the training time of the NCF model. I currently have 138k users and 1470 items. It takes more than days to finish the training process.
@@efesencan8079 Hmm nothing in particular to this. You can always reduce the model size (layers, embedding size, etc.), scale your training up (get a more powerful machine - GPUs, etc.) or scale out (distributed training with SparkML or something). It's a bit hard to say without more specific info. Hope this helps!
hi @Efe Sencan can you give me the link to your dataset please, i am having trouble finding one, i am also working on social media users.
dataroots- well.
very good insight!
Thanks for watching!
Good demonstration. Thanks for sharing
Thanks for watching!
Hi Vitale and everyone, great presentation so far, thanks for sharing this with me. Have you guys worked with Object Tracking models?
Hello Pierangelo, thanks for the comment! Unfortunately I’ve never worked with these models, but I think it’s a very interesting topic. For example I saw that in London a company has implemented a tracking system to keep the queue of people ordering in a pub. Do you know other particular implementations?
Nice guys ! You look like rock stars !
Thank you, loved the explaination, you covered quite a lot in very less time and also very clearly
Glad you liked it
Great content, thanks guys!
Hii! Do you any code related to real time interview app with streamlit and python
Code for this demo: github.com/datarootsio/rootslab-streamlit-demo is about voice transferring with a lightweight streamlit application, no real time interviewing
For me, this was the clearest explanation..!!
👍🏼
Sounds like a halflife song from the soundtrack
I see what you mean
Great episode! Really enjoyed it!
Glad you enjoyed it!
starting 12 min there is no content please remove it
thanks for sharing
Very useful and cool. Thank you! We use S3 in our installation <---->DataSync<--->EFS mounted in ECS
hello,thanks for this video if u can pls send me the code plz
There are some links in the description. For google colab: colab.research.google.com/github/murilo-cunha/inteligencia-superficial/blob/master/_notebooks/2020-09-11-neural_collaborative_filter.ipynb
@@murilo-cunha thanks a lot
thanks.. I have question on userid information... is it possible to provide user related information as input to model?
Yes you can. But then you are moving towards a more hybrid approach (as opposed to the collaborative filtering approach in the video).
Man, this sound quality is terrible. I can only understand maybe 20% of what you're saying. Even google captions think you're speaking German half of the time. Very disappointed.
Howdy, cowboys!
awesome I like it 😊 I have a question is there a way to put a debugger and then try some codes on the fly without rerun everything like in a notebook in python Thanks
nice explained video i don't find this path in github. can u plz help me with source link source = "datarootsio/ecs-airflow/aws"
Very good presentation.
Can you share the code you have showed during the demo to create the pipeline?
thanks , but the problem with these kind of videos is , you are talking to an expert guy who know all these things, but someone who does not know these things will not understand anything ! i hope in future videos be more detailed and slowly explain each steps not only read slides !
no one can hold your hands through everything; you need to do some research on ur own to get a feel for the context of this domain. I'd suggest you to do that first and then come back to re-watch the video.
@@jagicyooo2007 Thanks for replay , i learnt and already built a recommender system and i understood these kind of videos is wasting time ! people should learn how to implement it not just short videos and highlights .
Where I find out the code showed in this demo/live? Could you share with us?
github.com/datarootsio/rootslab-streamlit-demo
@@dataroots Thanks
How can I convert speech to text using Streamlit ?
@Anirban Banerjee try this code : import streamlit as st import speech_recognition as sr def takecomand(): r=sr.Recognizer() with sr.Microphone() as source: st.write("answer please....") audio=r.listen(source) try: text=r.recognize_google(audio) st.write("You said :",text) except: st.write("Please say again ..") return text if st.button("Click me"): takecomand()
@@PrasunChakrabortyPC thanks so much
Do you modify the similarity distance by giving a pairwise constraint to the model? What do you modify exactly when you give this new constraint? (this is more a question about semi-supervised learning and not interactive clustering)
samuel samuel does whatever samuel does can he swing from a web? no he can't he's just a devops guy watch ouuuut here comes samuel