- 535
- 57 934
mardin mardin
Приєднався 24 лют 2014
Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster LevelHuawei 2024
Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level(Huawei 2024)
Переглядів: 45
Відео
Inference Scaling for Long Context Retrieval Augmented GenerationGoogle 2024
Переглядів 13День тому
Inference Scaling for Long-Context Retrieval Augmented Generation(Google 2024)
NeuralFeels with neural fields Visuotactile perception for in hand manipulation(CMU & Meta 2024)
Переглядів 22День тому
NeuralFeels with neural fields- Visuotactile perception for in-hand manipulation(CMU & Meta 2024)
VISRAG VISION BASED RETRIEVAL AUGMENTED GENERATION ON MULTI MODALITY DOCUMENTSTsinghua 2024
Переглядів 8День тому
VISRAG- VISION-BASED RETRIEVAL-AUGMENTED GENERATION ON MULTI-MODALITY DOCUMENTS(Tsinghua 2024)
BENCHMARKING MULTIMODAL RETRIEVAL AUG MENTED GENERATION WITH DYNAMIC VQA DATASET AND SELF ADAPTIVE
Переглядів 24День тому
BENCHMARKING MULTIMODAL RETRIEVAL AUG- MENTED GENERATION WITH DYNAMIC VQA DATASET AND SELF-ADAPTIVE PLANNING AGENT(Alibaba 2025)
MULTI AGENT COLLABORATIVE DATA SELECTION FOR EFFICIENT LLM PRETRAINING(HKUST 2024)
Переглядів 92День тому
MULTI-AGENT COLLABORATIVE DATA SELECTION FOR EFFICIENT LLM PRETRAINING(HKUST 2024)
UNDERSTANDING ALIGNMENT IN MULTIMODAL LLMS A COMPREHENSIVE STUDYApple 2024
Переглядів 34День тому
UNDERSTANDING ALIGNMENT IN MULTIMODAL LLMS- A COMPREHENSIVE STUDY(Apple 2024)
Data Selection via Optimal Control for Language ModelsCOAI, Tsinghua 2024
Переглядів 41День тому
Data Selection via Optimal Control for Language Models(COAI, Tsinghua 2024)
RULE Reliable Multimodal RAG for Factuality in Medical Vision Language ModelsUNC 2024
Переглядів 3614 днів тому
RULE- Reliable Multimodal RAG for Factuality in Medical Vision Language Models(UNC 2024)
Discovery of the Hidden World with Large Language Models(TMLR 2024)
Переглядів 2614 днів тому
Discovery of the Hidden World with Large Language Models(TMLR 2024)
Retrieval enhanced Knowledge Editing in Language Models for Multi Hop Question AnsweringUGA 2024
Переглядів 4021 день тому
Retrieval-enhanced Knowledge Editing in Language Models for Multi-Hop Question Answering(UGA 2024)
γ−MOD EXPLORING MIXTURE OF DEPTH ADAPTA TION FOR MULTIMODAL LARGE LANGUAGE MODELSTUD 2024
Переглядів 2521 день тому
γ−MOD EXPLORING MIXTURE OF DEPTH ADAPTA TION FOR MULTIMODAL LARGE LANGUAGE MODELSTUD 2024
Probing RAG Self Probing to Guide Language Models in Selective Document Retrieval(CAU, Seoul 2024)
Переглядів 6928 днів тому
Probing-RAG- Self-Probing to Guide Language Models in Selective Document Retrieval(CAU, Seoul 2024)
PATIENT Ψ Using Large Language Models to Simulate Patients for Training Mental Health Professionals
Переглядів 42Місяць тому
PATIENT-Ψ- Using Large Language Models to Simulate Patients for Training Mental Health Professionals(CMU 2024)
Grounding Partially Defined Events in Multimodal Data(JHU 2024)
Переглядів 26Місяць тому
Grounding Partially-Defined Events in Multimodal Data(JHU 2024)
Scaling Proprioceptive Visual Learning with Heterogeneous Pre trained Transformers(MIT 2024)
Переглядів 170Місяць тому
Scaling Proprioceptive Visual Learning with Heterogeneous Pre trained Transformers(MIT 2024)
AutoTimes Autogressive Time Series Forecasters via Large Language Models Tsinghua 2024
Переглядів 85Місяць тому
AutoTimes Autogressive Time Series Forecasters via Large Language Models Tsinghua 2024
LogicPro Improving Complex Logical Reasoning via Program Guided LearningPKU 2024
Переглядів 32Місяць тому
LogicPro Improving Complex Logical Reasoning via Program Guided LearningPKU 2024
Q* Improving Multi step Reasoning for LLMs with Deliberative Planning(Skywork AI & NTU 2024)
Переглядів 101Місяць тому
Q* Improving Multi step Reasoning for LLMs with Deliberative Planning(Skywork AI & NTU 2024)
Molmo and PixMo Open Weights and Open Data for State of the Art Multimodal Models(Allen AI & UW 202
Переглядів 67Місяць тому
Molmo and PixMo Open Weights and Open Data for State of the Art Multimodal Models(Allen AI & UW 202
RuleAlign Making Large Language Models Better Physicians with Diagnostic Rule Alignment(ZJU 2024)
Переглядів 43Місяць тому
RuleAlign Making Large Language Models Better Physicians with Diagnostic Rule Alignment(ZJU 2024)
DALK Dynamic Co Augmentation of LLMs and KG to answer Alzheimer’s Disease Questions with Scientific
Переглядів 85Місяць тому
DALK Dynamic Co Augmentation of LLMs and KG to answer Alzheimer’s Disease Questions with Scientific
General OCR Theory Towards OCR 2 0 via a Unified End to end Model(StepFun 2024)
Переглядів 97Місяць тому
General OCR Theory Towards OCR 2 0 via a Unified End to end Model(StepFun 2024)
On the Diagram of Thought(Tsinghua 2024)
Переглядів 552 місяці тому
On the Diagram of Thought(Tsinghua 2024)
Quiet STaR Language Models Can Teach Themselves to Think Before Speaking(Stanford 2024)
Переглядів 1072 місяці тому
Quiet STaR Language Models Can Teach Themselves to Think Before Speaking(Stanford 2024)
Source2Synth SyntheticDataGenerationandCuration Grounded in Real Data Sources(Meta 2024)
Переглядів 102 місяці тому
Source2Synth SyntheticDataGenerationandCuration Grounded in Real Data Sources(Meta 2024)
ReKep Spatio Temporal Reasoning of Relational Keypoint Constraints for Robotic ManipulationStanford
Переглядів 602 місяці тому
ReKep Spatio Temporal Reasoning of Relational Keypoint Constraints for Robotic ManipulationStanford
Teaching Small Language Models to Reason for Knowledge Intensive Multi Hop Question AnsweringCAS 202
Переглядів 612 місяці тому
Teaching Small Language Models to Reason for Knowledge Intensive Multi Hop Question AnsweringCAS 202
Conditional generative adversarial network assisted system for radiation free evaluation of scoliosi
Переглядів 132 місяці тому
Conditional generative adversarial network assisted system for radiation free evaluation of scoliosi
In Defense of RAG in the Era of Long Context Language Models(Nvidia 2024)
Переглядів 252 місяці тому
In Defense of RAG in the Era of Long Context Language Models(Nvidia 2024)
数据集,程序在github上有吗
学生讲的很好,老师太笨了
可否给个ppt
hi~想问下贵实验室也在研究医疗影像模型么~后续有计划定期更新医疗影响方面的论文么~thanks
我们的主要工作方向
if u make content in English you would get better impression
Thanks Dalao
ppbklu 机u😊u p n n j😮lnlulmu aha z❤❤aa z za z❤aa z za z z z❤a az za z z z z aa❤a❤a z z❤❤a z a❤哇哇哇😊文盲mwkqaamaszjhaaaaaaasqjmajpaaaaqaqaaaquappppjjaaāaaaqajwwqaqsaaaqaj j mjj m w q w pa a aj j m w q w pa a a s w w w a e a❤alla w ai a a q a a sa ww wwq
這篇文章的結構真的很混亂,圖也畫得很差
接下来又是一堆人来玩弄gpt-4o的时候了。🥲
Hello, do you have the code for this paper? thank you!
Hello im an avid blender 3d user, I’m very interested in working with you on a project
很精彩的report,感谢大佬🙏
Can you please explain in English
could you please provide any guidance using it with a custom dataset
蠻好奇報告過程中的問答,希望也可以在影片中
感謝分享!
请问2分20秒左右提到的上一篇论文,是把attention维度和time维度倒置,是指的哪篇论文?谢谢
iTransformer?
English sub atleast?
很有深度的解读。爱来自广东🥰
这是把组里论文分享meeting发油管了吗) 关注了
这篇文章不错
😊
在开组会吗
Hi, I am unable to find the dataset link? Is it opensource ?
这个提问题的人好牛逼啊,哪位大佬?
浙大的组内分享么?
good work !!
ni hao world!
I wish I knew Chinese
请问有代码吗
假如model那块使用qwen14b这样小一点的模型,也能做到吗?
Thank you for your amazing research. It is so hard to find fast and reliable ways to get embeddings for music. Your work is a real savior.
These presentations you uploaded are super helpful, thank you and please keep it up!
Hey can make video in English
Good summary. Thank you for posting
可能是因为刚刚开始msc 比较菜的缘故吧 超级讨厌读paper,读着读着就困了(。 超级高兴找到了这个channel,现在读paper 像看剧一样愉快了hhhh
所以低级别到底是什么意思呢?像local edge / corner / intensity change 而不是object level 的理解吗?
嘿,小祖宗,标题为啥非得整成洋文呢?就像中国的饺子一样,馅儿香喷喷的全是肉,却偏偏取个“法式吐司”的名儿,这不是白白让人摸不着头脑吗? 看视频的人,大多数是说中文的,你说中文标题,亲切又地道,才好吸引他们点进来嘛。举个例子,你要是刷英文视频,看到一个全是英文的标题,是不是也会觉得别扭,心里嘀咕一句“这到底是哪国风味?” 再说了,用中文标题,还能顺便秀一把咱大中华的文化底蕴。来个四字成语,一两句诗词,吊足胃口,让大家伙儿都好奇得不行,非点进来一探究竟不可。 听老头的,下次做视频,标题可得跟内容配一脸,用中文说中文故事,才是正经路子!
論文名稱 + 發表機構 + 年份資訊,清楚的讓點進這影片的人知道是在講哪一篇文章,且可以讓對這篇文章有興趣的人可以快速搜尋到我覺得沒啥毛病。
@@JosephLiaw 是的,但你看,我只会说英语,当我搜索这篇论文时,我看到一个英文标题的视频。然后当我试图观看时,每个人都在说一些来自月球的难以理解的语言,就像是点击诱饵,或者大口喝了一大口可口可乐后发现原来是酱油一样。令人不快的惊喜是不愉快的。
这跟finetune有本质区别么。文章有没有跟finetune的方法做比较?感觉包装得很晦涩
🥰🥰🥰🥰
GPTs 跟assistant API 一出爐真的是搞的人仰馬翻😂
是……
讲的挺清晰的 很棒
Can I have the slide ?
谢谢大佬 收获良多!!
It would be great if you could please make this video in English.
我觉得这里并行计算的意思,应该是o的输出是并行计算的。就是o1、o2、o3的计算是并行的(只需要根据前面的k,v就可以计算出来,k,v的计算同样是并行的)。然而rnn中的隐变量不是并行计算的,其需要计算出前一个hn-1才能计算hn,是串行的
hhh学长也说了,不好意思刚没看完就评论了
really helpful content. Thanks!
你好,请教下Indicator里的Reward score是如何计算的,是使用OpenAssistant模型直接打分的吗?是用的比如 OpenAssistant/reward-model-deberta-v3-base 但是这个模型好像智能对QA场景进行打分
这是开讨论班么?小朋友被挂黑板了,哈哈哈 。。。
Would you have your presentation in an english version? Or could you enable the autotranslate option on your videos?