Can you read Chinese with 500 Characters??

Поділитися
Вставка
  • Опубліковано 17 бер 2023
  • Watch a native speaker attempt to read Chinese with only 500 characters. The results may shock you...
    ✅ Discover thousands of Chinese stories at Du Chinese: duchinese.net/
    Save 10% off your subscription with code: ABCHINESE
    ✍️ This video attempts to prove if you can read Chinese with only 500 characters. We will use 4 different source materials: 1) An online article 2) HSK 6 story 3) 3rd grade textbook and 4) WeChat conversation. All source materials will be run through a software filter that "X" out any character that is not among the most common 500 characters.
    👩🏻‍🏫 Check out Grace's channel: / gracemandarinchinese
    ▶️ How many characters you need to read Chinese (my opinion): • How many Chinese chara...
    ✧Support my channel for FREE by using my affiliate links✧
    🟠 Temu: temu.to/k/usqUdZaUZ00xM2O
    ⚪ Amazon: amzn.to/3D5ZVUr
    ✧✧Products I USE and ❤️(referrals/affiliate)✧✧
    📕 Favorite Chinese App: www.duchinese.net
    10% off with code: ABCHINESE
    📺 Learn Chinese w/ REAL Media: www.dong-chinese.com
    20% off with code: ABCHINESE
    💳Discover Credit Card: refer.discover.com/s/XIANGBO50
    🛡️SurfShark VPN: surfshark.deals/ABCHINESE
    📱Tello (cheap phone plan): tello.com/account/register?_r...
    📈Webull (invest in stocks): a.webull.com/KMWqe6bosPkpDLWQu9
    💻 Chinese Filter Tool (Windows application): drive.google.com/file/d/1u_DN...
    2000 Most Common Characters (in order of frequency): docs.google.com/document/d/1Y...
    📚 REFERENCES:
    Dr. Jun Da's study: lingua.mtsu.edu/chinese-compu...
    Online Article: www.toutiao.com/article/68088...
    HSK 6 Story: duchinese.net/lessons/932-a-b...
    3rd Grade Textbook: appxxkb.szxuexiao.com/html/38...
    WeChat Conversation: zhuanlan.zhihu.com/p/538025648
    ⚠️ NOTES:
    This is not a conclusive experiment and involves many variables that are difficult to control. It can only show what is potentially possible. (typo at 19:12- 慢慢地)
    💖SUPPORT MY WORK! ✈ ko-fi.com/abchinese
    ✧FOLLOW ME!✧
    🎮 Discord ✈ / discord
    📸 Instagram ✈ / _abchinese
    🎵 TikTok ✈ vm.tiktok.com/cES6Qt/
    🎹 MUSIC from Uppbeat (free for Creators!):
    uppbeat.io/t/anuch/alert
    License code: HTXLNEQEITS9IJ6Q
    uppbeat.io/t/yokonap/airplane...
    License code: M6QF5PX843NGXNDQ
    uppbeat.io/t/justin-lee/gentl...
    License code: LDEGUFX83N8NF6VD
    uppbeat.io/t/soundroll/tropicana
    License code: MSAEOHDBUNX6P5PC
    uppbeat.io/t/sensho/good-times
    License code: QG0SLLVXAK687QQT
    uppbeat.io/t/simon-folwar/so-...
    License code: ZDHNQRM1XTWFFTA3
    uppbeat.io/t/justin-lee/dawn-...
    License code: 0XSLGIAC3DPVXXCT
    uppbeat.io/t/pecan-pie/string...
    License code: NVYB7IVSTX7TTX8Z
    **Fan mail ✈ DM me on Instagram or Tiktok

КОМЕНТАРІ • 200

  • @GraceMandarinChinese
    @GraceMandarinChinese Рік тому +378

    It was a really cool experiment! I had a lot of fun😆 Thank you for having me!

    • @ABChinese
      @ABChinese  Рік тому +30

      So good to have you on my channel!! Thanks for collaborating with me❤

    • @genace
      @genace Рік тому +8

      Thank you @ABChinese and @GraceMandarinChinese. I look forward to possibly seeing more collaborations with you two in the future.

    • @sasino
      @sasino Рік тому +2

      ​@@ABChinese I absolutely agree with you on DuChinese: flashcards haven't helped me much, except in the beginning when I studied the basic characters, however, reading content has helped me rank up very fast, as I learned more than 1000 characters in 6-8 months. Then I took a few months break, but now I can still read content.
      Reading content is the key in my opinion. I'm subscribed to Mandarin Bean and Du Chinese

    • @caoimhinyay
      @caoimhinyay 7 місяців тому

      omg your my fav youtuber thank you so much 😭

  • @wasniz7566
    @wasniz7566 Рік тому +87

    I see Grace I click. Great to hear her casually speaking Chinese 😂

  • @cmaven4762
    @cmaven4762 Рік тому +112

    Something I realize is how much the source text can influence how easily one can understand. An example: 滚 is #1572 on the frequency chart you used, yet it is one of about 80 symbols I currently recognise because it gets used so frequently in Cdrama to represent "scram", "get out of here" or other less polite invitations to exit the scene... lol ... Meanwhile frequently used words like 国 [#20] are not on my list because Cdrama characters don't use them as much.
    This is a fascinating experiment ... thanks to your dad and Grace for making it possible!

    • @analogpark8059
      @analogpark8059 Рік тому +11

      That's a good point about 滚. I just watched a movie today where ppl kept saying 'gun' (didn't see the character), and your comment reminded me to look it up-- lo and behold, the very same word!

    • @7kaisheba
      @7kaisheba Рік тому +1

      ❤ love that she did this experiment!

    • @brunocardoso7132
      @brunocardoso7132 Рік тому

      what was the frequency chart he used??

    • @cmaven4762
      @cmaven4762 Рік тому

      @@brunocardoso7132 He posted a link to it in the description.

    • @commenter4898
      @commenter4898 Рік тому +8

      The thing is, with any frequency list you are basing it on some corpus, but word frequencies actually depend a lot on the corpus. Drama, chat, technical document, news report, youtube comment, workplace, etc all demand different vocabularies. The only characters that really are used in every context at high frequency are pronouns, numbers, prepositions, auxiliary verbs and grammatical particles.

  • @aafrophonee
    @aafrophonee Рік тому +71

    This was a really fun video! I'm a big fan of Grace's videos, so it was great to see this collab. I'm also a DuChinese premium user, I love that app. I think a video about the details that went into your experiment would be pretty interesting

    • @ABChinese
      @ABChinese  Рік тому +4

      You got all the best resources;)

  • @pred4507
    @pred4507 Рік тому +16

    There are 2663 individual characters in the hsk 6 vocab (5000 words assembled by 9662 characters). You can have a very big vocab with 500 individual characters. I guess 1000 is like a hsk 5 level. //edit: Thought it's quite interesting so I did some coding.
    You are able to form:
    -1175 words with 500 chars
    -2413 with 1000
    -3520 with 1500
    The wordlist is the hsk 6 one. Ofc a native speaker will be able to make up even more words with the given chars.

  • @susamirain
    @susamirain Рік тому

    Great video! Really loved seeing Grace again here! Yay! Keep these videos coming.

  • @MaryGouge
    @MaryGouge Рік тому +2

    Great video! I really loved seeing you two collab, my fav two chinese language teachers

  • @KaMi-gz1il
    @KaMi-gz1il Рік тому +31

    This was a very interesting experiment! Feeling more motivated to continue mis studies thinking I won't need a ton of characters to understand Chinese content. Also, I was just this past week I started looking for a better alternative for a learning language app, since I paid for Duolingo last year and I think it didn't do much to improve my knowledge. I just got Du Chinese and it's very interesting. I will stick with this one this year! Thank you for the very helpful content! Hopefully this year I make it to HSK 3 :')

  • @microcolonel
    @microcolonel Рік тому +11

    I have experienced being barely literate in this way, and can confirm you can enjoy reading at this level of vocabulary.
    Also a big fan of Grace, nice to see her here. :+ )
    I have made pieces of software like your father did, I have been cleaning up *word* lists rather than character lists as well.

  • @seanconnor8166
    @seanconnor8166 Рік тому +8

    Great vlog. This experiment makes me much more optimistic about the possibility of learning to read Chinese. Great insights!

  • @Speechbound
    @Speechbound 8 місяців тому

    You've got a new subscriber, thanks for the awesome content! 你的视频非常有趣

  • @MrBrunoMi
    @MrBrunoMi 3 місяці тому

    great video. very encouraging for a learner! 非常谢谢

  • @erikl.1860
    @erikl.1860 Рік тому +14

    I like you! This is a very interesting video.
    I have studied Chinese for 8 years, about 20 years ago. I guess we learned about 1500 to 2000 characters, but I am still not fluent at all. However, when I travel to China, I can travel around without any problems. Chinese is such an interesting language!

  • @dottieshields5918
    @dottieshields5918 Рік тому

    Thanks for this Andrew.

  • @ritasallai152
    @ritasallai152 5 днів тому

    This is great! Many thanks to you and who made this computer programm! It is superb❤!

  • @zeynepsar9550
    @zeynepsar9550 Рік тому +3

    As a Chinese learner I enjoyed this video a lot. This is the collab that I didn't know I needed 😄

  • @arichichi
    @arichichi Рік тому +12

    It confirms my fear. Like, maybe you don't need to know how "ice" is said in any language. Yet, if you have to deal with it, even though it's so rare, you'd miss your target completely. Rare words get their revenge by providing a lot of context that you otherwise will never be able to manage to understand.

    • @xuexizhongwen
      @xuexizhongwen Рік тому +3

      Ice is a rare word?

    • @arichichi
      @arichichi Рік тому

      @@xuexizhongwen ranks 2096 according to Wiktionary, so yeah, it's rare

    • @Alesti5
      @Alesti5 Рік тому +1

      @WHENDOESITEND? lmao at that point you’re better taking 15sec to look up that word

  • @erinire
    @erinire Рік тому +6

    Your editing improves every upload, it’s definitely noticeable keep up the good work

  • @DecaSpace
    @DecaSpace 11 місяців тому

    You are an excellent content creator! Keep up the great work and energy!

    • @ABChinese
      @ABChinese  11 місяців тому +1

      Thank you! I'll keep trying:)

  • @deacudaniel1635
    @deacudaniel1635 Рік тому +3

    This is one of the most interesting videos about Chinese language I've ever seen on UA-cam.I didn't know about that program you mentioned at the beginning of the video before, so I copied the 现代汉语常用字表 on word, then deleted the characters I didn't know so word will count the number of characters I can read😂.The result was around 2100 characters but I would say I can recognize only around 2000 because some of them are not very clear to me.Even with this number of characters, I still regularly see new ones which I can't recognize.

  • @TobiasBalk
    @TobiasBalk Рік тому

    I love to see unexpected collabs between two channels I follow 😁😁😁😁

  • @fabulouschild2005
    @fabulouschild2005 Рік тому

    Your videos are great ❤

  • @williamjohn6939
    @williamjohn6939 Рік тому +1

    Amazing experiment! You rule!

  • @TheGhostPlanet
    @TheGhostPlanet Рік тому +1

    You videos are always entertaining and to the point. Keep up the great work. 😊

    • @ABChinese
      @ABChinese  Рік тому +1

      Thank you for watching! I try my best;)

    • @TheGhostPlanet
      @TheGhostPlanet Рік тому

      @@ABChinese you do a great job 👍🏻👍🏻

  • @genace
    @genace Рік тому +12

    Awesome experiment. We’ve all heard that we need to know a certain number of characters, but this is the first time I’ve seen it actually tested out. I know you’ve been working hard on this video and it turned out great. I enjoyed watching🙂Thank you.
    btw that’s a very cool transition at 6:43!

    • @ABChinese
      @ABChinese  Рік тому +2

      Thanks for watching Josh!

    • @danielzhang1916
      @danielzhang1916 Рік тому +1

      the last test would have been better at 300 words for context

  • @kylosun
    @kylosun Рік тому

    Awesome video top content as always!! - KS

  • @ku9305
    @ku9305 Рік тому

    This was super interesting, thanks for putting in the hours to create this video!

  • @ellotheearthling
    @ellotheearthling Рік тому +11

    Mandarin is a beautiful language

  • @zacharymccann4138
    @zacharymccann4138 Рік тому

    Great content 🎉

  • @drl3247
    @drl3247 2 місяці тому

    哇! 非常有意思!谢谢。。。where can I obtain that filtering software?

  • @bandinopla
    @bandinopla 18 днів тому

    super interesting to visually see what the amount of characters you know do in regards to reading.

  • @TheAnimeq
    @TheAnimeq Рік тому

    Great experiment, well done to both of you! Everybody says - input input input, yeah you just need to read, good there are some online materials to start from (DuChinese you pointed, MandarinBean, some phone apps etc). Hope to be able to read normal books any time soon :)

  • @jim37569
    @jim37569 Рік тому +7

    I signed up for duchinese after your first video about it but I totally forgot to use your signup code. Sorry about that! But I really appreciate the recommendation, it totally rules. I only wish they had things listed using HSK 3.0 levels, but that's a minor complaint.

  • @Hacktheplanet_
    @Hacktheplanet_ Рік тому +1

    Thats it! Im adding the top 500 most frequent chinese characters to my anki deck along side my hsk3!

  • @sunofsakura3143
    @sunofsakura3143 5 місяців тому +1

    Thanks!

  • @josephbornman8462
    @josephbornman8462 Рік тому +2

    Having the plot presented to you and then having native chinese speech be in this plot is really fantastic for getting in the groove of the language.
    So much better than if she had given her thoughts in English

  • @flaviospadavecchia5126
    @flaviospadavecchia5126 Рік тому +1

    Fascinating experiment! I think the 3rd grader texts were probably quite difficult and not all that common hahaa I would have tried also some novels and simple stories and longer chats.

    • @ABChinese
      @ABChinese  Рік тому +2

      Grade school textbook have different vocabulary than HSK, since HSK focuses more on functional vocabulary and textbooks tend to focus more on literature, including old literature. I would definitely have tried with more text, but this video was already 20 minutes long with 4 text, so... I don't think people would watch a longer video.

  • @liznludo
    @liznludo Рік тому

    This was so interesting, thanks! I am still scared to learn, but I feel braver. A bit 🤣

  • @dietrichdietrich7763
    @dietrichdietrich7763 Рік тому +1

    Hey that software is really well needed,
    When can we get it publically? Cos I like
    Language Software (not enough for PC)

    • @ABChinese
      @ABChinese  Рік тому

      Check the post I made in my community tab!

  • @sangyoonsim
    @sangyoonsim Рік тому +8

    I'm South Korean and Korean has some Chinese loan words.
    I think I'll be able to read Chinese IF I learn those 500 characters!

    • @ruffhakes7419
      @ruffhakes7419 Рік тому

      Don't South Koreans learn hanja at school as part of the curriculum?

    • @as2s3hf7gff
      @as2s3hf7gff Рік тому +1

      @@ruffhakes7419 but they read content, their school material, ect in hangul form....... It's very different with Chinese n Japanese people, that implementing their Chinese character knowledge in real situation

  • @garyd.8249
    @garyd.8249 11 місяців тому +3

    feels like a Chinese reading Japanese articles. You don't understand the pure Japanese part but you can still understand the meaning from the remaining Chinese characters.

  • @LudicrousTachyon
    @LudicrousTachyon Рік тому +12

    I'm wondering if characters is the best way to select what's readable as opposed to words like is generally done for other languages. I'd like to see this same experiment but with words rather than characters. For example, take the characters that are in the top 500 words and just allow those. The number of characters certainly won't match and may even be less than 500, but the set might be different from the 500 most common characters. I'm curious how that affects readability.

  • @sheph-zb6uv
    @sheph-zb6uv Рік тому +1

    wow i have hope now thx

  • @SabineLeppanen_Art
    @SabineLeppanen_Art 5 місяців тому

    DuChinese is fantastic!

  • @Fabio-dn3fx
    @Fabio-dn3fx Рік тому

    This was a very cool thing to visually see how important characters are (of course!). But honestly my problem isn't the characters themselves, I love learning them, but the words!
    Yeah of course I know the character 的 for example, but I may not know the WORDS 目的、的確、地款等

  • @atruebossawbw
    @atruebossawbw Рік тому +1

    This is amazingly insightful! Is there anywhere I can download the software? I want to try the same experiment with Japanese.

    • @ABChinese
      @ABChinese  Рік тому +3

      I can ask my dad if we can make it a downloadable thing

    • @atruebossawbw
      @atruebossawbw Рік тому

      @@ABChinese Yes please! (;^ω^)

    • @w9316
      @w9316 Рік тому

      have you completed the experiment? I know some 700-800 characters and can read simple texts in Japanese trivially and mostly extract the vital information from intermediate texts, but stumble pretty hard on any slightly advanced or specialised topics. I'd be interested to hear your experiences.

  • @chuchi9935
    @chuchi9935 Рік тому

    I love this video

  • @treelineresearch3387
    @treelineresearch3387 Рік тому +2

    I'm interested in learning to read some Chinese mostly so I can more quickly scan through electronics vendor websites and part datasheets, anyone have any tips? Should I be focusing on studying that kind of technical content using tools like the Zhongwen browser plugin, or would it be more effective to start with more traditional "beginner" type content? Output, writing/speaking, isn't as important to my goal as input, but would it be substantially beneficial to practice anyway?

    • @ABChinese
      @ABChinese  Рік тому +2

      The beginner stuff will be useful to start, because you'll have to learn the foundational concepts no matter which route you eventually go

  • @ExplainDigital
    @ExplainDigital Рік тому +2

    please make a video with a random japanese article and try to translate it in english and then tell how much you understood in percentage after translating the real meaning. would be more cool if you could repeat the vice versa with a native japanese speaker about chinese

  • @steven-qm8ic
    @steven-qm8ic 6 місяців тому

    Do you have a frequency list for vocabulary?

  • @vangod9831
    @vangod9831 Рік тому

    in your caligraphy videos, what is the name of the paper pattern that you use for writing the characters? thank you for the vids

    • @ABChinese
      @ABChinese  Рік тому +1

      Honestly, I’m not sure what’s the “proper” name for it. I just call it a character practice grid with the 米 grid pattern (cuz it looks like that Chinese character)

  • @user-bc9fe7pd9r
    @user-bc9fe7pd9r 10 місяців тому

    Pareto distribution?
    In the real world.
    Its amazing how your work exists and your just super awesome!!!!
    1000/10 rating %

  • @pptzzx
    @pptzzx Рік тому +1

    很有创意的节目

    • @ABChinese
      @ABChinese  Рік тому +1

      老粉丝啊~~谢谢观看

  • @kamiyama-chairdesklamp
    @kamiyama-chairdesklamp Рік тому +2

    This is interesting to me as a native Japanese speaker (our Daily Use characters are 2000-2100 letters) who is learning Cantonese. I often find the most hurdles are when we use differently the characters from their original meaning, and then I sorta get stuck and overloaded like overtaxed RAM if that weren't a reasonable metaphor 電腦wouldn't be a word)

    • @danielzhang1916
      @danielzhang1916 Рік тому

      it might be easier to start with Mandarin, because Cantonese created their own characters, then you can go on

    • @samaval9920
      @samaval9920 5 місяців тому

      @@danielzhang1916Probably all other dialects also
      .

  • @nekomancer4641
    @nekomancer4641 Рік тому

    The way chinese letter uses 偏旁 to articulate more complex characters must really helps too. Even if one does not know a character, one might know a part of it and therefore make some sense

    • @danielzhang1916
      @danielzhang1916 Рік тому

      although some characters have very different pronunciation with different parts

  • @sander_bouwhuis
    @sander_bouwhuis Рік тому

    Great video!
    Indeed, the most important take away is that 500 characters is NOWHERE near enough for understanding texts. That native girl simply knows pretty much all Chinese words, so she can guess what it could mean. I'm currently doing HSK2 and would have trouble with these texts even though I know far more than 500 characters.

  • @salvadorsanchez5057
    @salvadorsanchez5057 Рік тому +2

    i think it makes a lot of sense. the most frequent 150 convey almost no meaning because theyre almost just grammar, and grammar is necessary in every single text so of course theyre the most frequent ones.
    after those grammar bases are set, the first 350-650 characters are all going to start being super important to start conveying meaning, and like grace said eventually they all become rarer and more specific.

  • @Hacktheplanet_
    @Hacktheplanet_ Рік тому

    Nice programming!

  • @yidminselaks
    @yidminselaks Рік тому +4

    There is a big problem with this test: the native speakers already know what the characters some words consists of, so if they see a word with only one character they can fill in the missing character with the help of context. But this won't be possible with non-native speakers who won't know or understand words that contain characters outside the 500 characters they know. E.g 双xx, like in the video, could easily be filled in as 双包胎 by a native speaker, but a non native speaker who doesn't know what characters the Chinese word for twins is consisted of probably couldn't guess the word.

    • @ABChinese
      @ABChinese  Рік тому +2

      Yeah, I realized that was one problem, but after reading your comment, I suddenly thought of a way I could've fixed it. maybe? If I make every string of unknown characters just one "X" then she wouldn't be able to tell how many characters there are and can't guess based on length of words. So like 双胞胎 would be 双X instead of 双XX.

  • @catv2184
    @catv2184 Рік тому +1

    as a non-native, i think it's worth mentioning that even the words we don't understand can sometimes have clues for us to guess their meaning (ex: two characters words in which we know one of the characters, radicals, certain clues of whether it's a noun, adverb, adjective etc). so yeah we can probably read even better than you expect with only 500 characters

  • @squalllfviii
    @squalllfviii Рік тому +1

    I currently know probably about 650-700 characters. If I can read that much from knowing 1,000, I can't wait til I can read at least 1,000.

  • @nathansalyer
    @nathansalyer 7 місяців тому

    It was nice to see the 150 since that's about where I'm at, just a third of the way to 500 😂

  • @Theo-oh3jk
    @Theo-oh3jk Рік тому +2

    So, ~ 100 characters accounts for things like numbers, grammatical particles, and common affixes. ~500 characters accounts for those and very common basic nouns and verbs. Together, these account for ~ 75% of text you encounter. This is true, but mostly this isn't helpful because you will still be missing the most important (and much rarer statistically speaking) nouns and verbs that are the key content. Every text will be different, for example very basic, more or less artificial language in textbooks or graded readers will be designed to make sense with a low character count. Social media also tends to be pretty basic. Anything more though, and you will quickly discover that you *need* those rare content words to understand what you are reading or hearing! Studies with reading comprehension show that if you understand less than 90% of the text, you will not understand and be frustrated. If you understood between 93% to 97% of what you read/heard then you can get the message with a lot of work. If you understand about 98% of the message, you can figure out the meaning of any words you don't know. So, if you have a text of 100 characters, and you understand only 90, odds are that you are missing those key rare content words. If you assume an upper conversational-level text that uses about 2000 characters then you need to be able to understand ~1,800 of them to potentially muddle through towards understanding the message. This is all complicated by the fact that every text differs, not only in its level, but in how many words--as opposed to characters--there are in the text, along with how much other context like pictures, tables, or graphs there is in the text. In my experience, social media is really easy to get, but even trying to read the front page of a newspaper is really challenging at my level.

  • @jeffreysetapak
    @jeffreysetapak Рік тому

    你这个可爱的小宝贝,中文普通话教的不错嘛。抱抱!!

  • @user-rz1wg8fn1m
    @user-rz1wg8fn1m 7 місяців тому

    Can you make it again with a chinese leraner from a different level ?

  • @pierregagnon2666
    @pierregagnon2666 Рік тому +2

    Heyyyy it's Grace . I'm one of her subs too. 欢迎来到频道!

  • @JimNH777
    @JimNH777 8 місяців тому

    Let's imagine you do something like this with English vocabulary: 'if you find this video interesting please subscribe' turns into 'if you find this -- -- please -- ' - And that's way more than 50% of the sentence ;)
    BUT - what's so powerful about conversations is you can rephrase it 'if you like what I do, please follow my work' - and this is the same meaning using only the most basic vocabulary. When it comes to reading - there's no person on the other side to rephrase anything - that's why it makes much harder ;)

  • @cmyk8964
    @cmyk8964 Рік тому

    So Thing Explainer by Randall Munroe is possible in Chinese with relatively few concessions?!

  • @PHH81
    @PHH81 Рік тому

    interesting!

  • @mapleleaf4ever
    @mapleleaf4ever Рік тому

    That is pretty impressive with only 500 characters.

  • @RandomBb56
    @RandomBb56 Рік тому +1

    if you are a native speaker, you can sometimes even mess up the order of the words in a sentence and you can still understand its meaning.

  • @zhubajie6940
    @zhubajie6940 5 місяців тому

    Agreed it's the number of words, not the character number that is important! Sometimes you can guess the meaning say 冰茶 ice tea, be part right 红茶 red tea--》English is black tea, or not have a clue 清淡 literally clear weak which means not spicy or greasy of food.

  • @xuexizhongwen
    @xuexizhongwen Рік тому +8

    Interesting experiment, but for the reasons you mentioned in the video, it tells you absolutely nothing about how much a learner would understand. That would depend mostly on how many WORDS the learner knows. I bet if you found a learner who has only learned about 500 characters and gave him the test, he would understand very little. Of course, it also depends very heavily on what the text is.
    I have no idea how many characters I know, but I think at least 2000. (But there are probably also a lot that I wouldn’t know individually, but know in context.) And it is still not enough to understand everything I read. A learner just can’t compare to a native speaker in an experiment like this. Also, I never understood the focus on characters instead of words. Like the example you gave, if you see the word 存在, just knowing the character 在 would be of no help to you at all. This gets much worse at slightly higher levels. Also, what does “knowing a character” even mean? Does it mean you know how to pronounce it and its basic meaning? I think if you don’t know all the various meanings it could have in different contexts, you don’t really know it. You’ve only started to come to know it.

    • @ABChinese
      @ABChinese  Рік тому +7

      I completely agree with you and had the same confusion on why people don't count words. I actually contacted Dr. Jun Da and asked him that question in passing. He told me the reason people don't count words is because it's almost impossible with the nature of Chinese and current technology. Since Chinese doesn't use spaces, and each character serves as a morpheme (that also CAN be a word), it's very difficult for machines to identify "words." There's a formula he used in his study to estimate the number of words in his corpus, but it's only an estimate. Who knows, maybe when AI takes over, we'll finally be able to count Chinese words!

    • @tebby24chinese
      @tebby24chinese Рік тому

      @@ABChinese Chinese parsing is actually quite accurate. I've used a python library called 'jieba' in some projects. Even if the accuracy isn't perfect, it's more than enough to get a solid estimate at a word count.

  • @WCris99
    @WCris99 Рік тому

    damn the 6:38 shot is so cool 🤯

  • @caigou
    @caigou Рік тому

    6:57
    哈喽同学
    图书馆
    学生证掉了
    7:10
    谢谢啦
    (idk)
    两杯起送
    12:00
    那么
    晚上
    考试
    (next page)
    复习
    那好的
    考试重要
    大题
    不用了
    17:50
    居然在饭堂也能遇到
    那么特别的缘分
    (in Sticker)男大学生
    有点巧
    (idk)
    下次约你
    (next page)
    下次是什么时候?
    不是吧
    好吧

  • @mayblu
    @mayblu 4 місяці тому

    that actually freaks me out that u used an article abt twins being born 87 days apart because i just read an article about that earlier. and it wasn't new, i thought about it and searched for it LOL

  • @Rayenn_19
    @Rayenn_19 8 місяців тому

    13:10 Timestamp for my future list

  • @ize1000009
    @ize1000009 11 місяців тому

    Pareto sends their regards 😎
    But the thing is with chinese, that characters do not equal words. You need to know both characters and their combined mining in 2+ character words.
    I know 1500 characters, but I can’t read many things even when I know all the characters in a text, because i don’t know the words in the text yet. Obviously Grace knows a lot of words as characters too.
    Edit:
    Finished the video I see you said this exactly the same points 😂

  • @zhubajie6940
    @zhubajie6940 5 місяців тому

    Based on an approx. trend (the inverse difference formula with R^2=.9996) you get the number of characters compared to understanding:
    Number Understand %
    150 23.5% (20% she said)
    500 79.6% (80% she said)
    1000 90.0% (90% she said)
    1969 95.0%
    2000 95.1%
    2663 96.3% HSK 6
    3916 97.5%
    9756 99.0%
    97350 99.9%

  • @Luiseut59
    @Luiseut59 Рік тому

    Can you do this experiment with Japanese?

  • @forgetmenot2512
    @forgetmenot2512 Рік тому

    I wish there was that list of those common characters - I was waiting for it and it isn’t there…

    • @ABChinese
      @ABChinese  Рік тому +3

      Hi~ here it is: lingua.mtsu.edu/chinese-computing/statistics/
      I used the "Modern Chinese" list and the link is also in the description

  • @silverchairsg
    @silverchairsg Рік тому +1

    Realistic version: The Chinese teacher deducts 1 mark for every word you don't know or mispronounce, and then you fail your Chinese oral, and you get put in Chinese remedial class and have to stay back after school. Then your parents look at your grades and say "Never mind boy I also scored F9 for Chinese in my time, I will send you for Chinese tuition class."

  • @McDucknald
    @McDucknald 7 місяців тому

    Poor A is trying so hard with B lmao

  • @Ironfist85hu1
    @Ironfist85hu1 Рік тому

    I wish every chinese video on the internet would use the same type of subtitles.

  • @rodrigoappendino
    @rodrigoappendino 11 місяців тому +1

    13:36 Now she knows how I feel when I try to read something in chinese. hahaha

  • @ParagonPKC
    @ParagonPKC Рік тому

    I need that software so I can find the commonly used characters I don't know on my own

    • @ABChinese
      @ABChinese  Рік тому +1

      Check my latest post under the community tab! You can download it there

    • @ParagonPKC
      @ParagonPKC Рік тому

      @@ABChinese tysm!!

  • @yef66
    @yef66 Рік тому +1

    中文覆盖率:
    1,核心汉字:67%,300个❗️👍
    2,基本汉字(高频字):80%,600个!(包括核心300个汉字在内)
    3,中频字:400个!高+中=1000个!覆盖率高达90%!
    4,低频字:1500个!高+中+低=2500个!覆盖率达到99%❗️👍
    5,超低频字:2500个!覆盖率只有1%!😂共5000个覆盖率达到99.99%(包括了古中文)

  • @nitsum8874
    @nitsum8874 4 місяці тому

    hey there! quick question, do you think it's possible to learn how to read chinese, but not actually know the language ? learn the meaning of the characters and group of characters, but never know how to pronounce them ? or is there something I'm missing out that would prevent this ?

  • @mikedaniels3009
    @mikedaniels3009 11 місяців тому

    亲爱的朋友们: 这是一个太棒的视频,都感谢你们。

  • @yunyung
    @yunyung Рік тому +1

    我会说普通话但没什么会读就让好多的中国好友们都惊讶,虽然我是个24小时忙的大学生 我还是会找时间在提高国语,所以呢父母给孩子们上中华学校挺重要的!!

    • @user-fs1sb4mn2b
      @user-fs1sb4mn2b Рік тому

      是台湾人吗?哈哈哈哈哈

    • @yunyung
      @yunyung Рік тому

      @@user-fs1sb4mn2b 不是,台湾人不可能用简体字

  • @yeroca
    @yeroca 8 місяців тому

    Just to play devil's advocate here, I think native speakers will be naturally filling in the missing words, because they often will know based on the context, and also will know if a the missing word is a name, adjective, verb, intuitively because of the sentence grammar /pattern.
    So native speakers have an advantage in reading with fewer words, and it's not clear how much that skews the overall hypothesis of your experiment.
    I think it would be better to test on someone who has only learned the 500 characters.

  • @silafuyang8675
    @silafuyang8675 Рік тому +1

    With 500 characters, you can't even read kindergarten books.

  • @WheeJones
    @WheeJones Рік тому

    0:40 Where is that setting? It looks like a middle school library

    • @ABChinese
      @ABChinese  Рік тому +1

      It is a public library haha

    • @WheeJones
      @WheeJones Рік тому

      @@ABChinese Ohh ok lol

  • @DemonFox369
    @DemonFox369 Рік тому

    She can’t explain it without more than 500 characters if she didn’t know more than 500

  • @simoroshka
    @simoroshka Рік тому

    I don't know Chinese, but I guess some words are made up of more than one character? Then wouldn't you need to know the meaning of combination in addition to the single characters or would you be able to guess?

    • @ABChinese
      @ABChinese  Рік тому

      Right, so a character in Chinese is a morpheme, it can be a word or can be part of a compounded word. A native speaker can kind of guess even if some characters are missing because the characters carry meaning even by themselves.

  • @alekseev1986
    @alekseev1986 Рік тому +1

    I need 7777+ characters 😂

  • @pistrov8150
    @pistrov8150 6 місяців тому

    15:10 literally me when trying to read Chinese but it’s a little bit better

  • @zdzislawmeglicki2262
    @zdzislawmeglicki2262 Рік тому +1

    The Chinese writing system would be so much easier to master if all or most of its characters were ideograms. Why, they could even be internationally adopted and used in public signage around the world. But the phono-semantic compounds, which make for most Chinese characters, ruin everything. The phonic part helps, if marginally only, those who speak the language already, but it's of no use to those who don't.

    • @danielzhang1916
      @danielzhang1916 Рік тому

      it doesn't work like Hangul or others where you can figure it out like that

  • @paper2222
    @paper2222 Рік тому

    13:51 it felt like all she read was "from the ... of the ... to the ... of ..."

  • @user-ig2qk5iz7d
    @user-ig2qk5iz7d Рік тому

    Just to point out, ordinary university graduates cannot do 8000 characters. Even those graduating from language or history major usually can only recognize about 5500. 8000 is scholarly: approximately all characters from Showen.