The library that you have used mentions use of custom models in point 3 of ReadMe. But, there's no parameter by the name "model" or "num_layers". I was wondering if you have figured out what's going on there.
thanks for uploading the video. one quick question how it is different from rogue, bleu and meteor, as they are also recall and precision-based.can we use rogue, meteor and bertscore if I am evaluating chatbot and why. Please excuse if this ques sounds naive, I am very much new to this
Quick Correction : ROUGE is used for summarization and BLEU is used for translation primarily [ time stamp : 4:04 ]
My bad in rush. Thanks
thanks for all! :) tomorrow i will watch your others videos of evaluate llm and rag :)
Rock on!
great work man....thank you so much for uploading such informative videos❤❤
Thanks for valuable video as usual. Waiting for multimodal and unstructured files/applying RAG
The library that you have used mentions use of custom models in point 3 of ReadMe. But, there's no parameter by the name "model" or "num_layers". I was wondering if you have figured out what's going on there.
thanks for uploading the video. one quick question how it is different from rogue, bleu and meteor, as they are also recall and precision-based.can we use rogue, meteor and bertscore if I am evaluating chatbot and why. Please excuse if this ques sounds naive, I am very much new to this
For Text Generation RAG, Bert Score won't work?
Though, in the second example, it gave 85% similarity
Weird right?
exactly thats why his reaction is strange
yeah, not so good to evalute a llm generation... I don't blame him perahps there isn't a better way.
Thanks Man!