#2 Introduction to Corpus Linguistics - Types of Corpora

Ғылым және технология

Hello there! In this video I present to you some of the most common types of corpora. This is by no means an exhaustive list but it's enough to get you started :)
If you want to use any of my videos, kindly let me know :)
Corpus annotation: Use TagAnt to tag your corpus for Part of Speech. (I'll talk about this in detail in future videos)
Download TagAnt: www.laurenceanthony.net/softw...
View Part of Speech Tags online: courses.washington.edu/hypert...
Contact me at: yassine.iabdounane@gmail.com

Пікірлер: 59

  • @YassineIabdounane
    @YassineIabdounane4 жыл бұрын

    Sample Copora 02:25 Corpora for Comparison 05:13 General Corpora 09:50 Specialized Corpora 10:58 Annotated Corpora 11:35 Unannotated Copora 17:11 Learner Corpora 17:50

  • @ghmarioumaima5391

    @ghmarioumaima5391

    4 жыл бұрын

    Hello! I want to email u but i couldn't find ur email. Would u please Write it for me and thanks.

  • @YassineIabdounane

    @YassineIabdounane

    4 жыл бұрын

    @@ghmarioumaima5391 Hi Oumaima! Sorry about that. There you go: yassine.iabdounane@gmail.com

  • @ikramullah3484
    @ikramullah34843 ай бұрын

    I am new to the field of Corpus Linguistics. I am learning too many things from your videos. Thank you for sharing such informative videos.

  • @miromarita3631
    @miromarita36317 ай бұрын

    God bless you sir I'm so grateful for learning this gorgeous lesson Thank you so much 🥰❤️

  • @0101799
    @0101799Ай бұрын

    Thank you for giving us insightful and organized lessons about the corpus linguistics!

  • @0101799

    @0101799

    Ай бұрын

    Also, it was very cute of you showing the "Helsinki" in the la Casa de Papel!!!!

  • @itsjustme5176
    @itsjustme51768 ай бұрын

    Thank you so much, clearly explained.. i'm doing my master's degree in spain and curpus lingsuistics is a new concept to me.

  • @user-np1kb2pw6e
    @user-np1kb2pw6e8 ай бұрын

    Plz sir keep sharing your knowledge with us ❤

  • @ferroumsamir6531
    @ferroumsamir6531 Жыл бұрын

    you are helping me a lot in my Master's degree in NLP. Thank you man ! Keep up the good work.

  • @YassineIabdounane

    @YassineIabdounane

    Жыл бұрын

    thanks for the nice words man! best of luck with your Master's degree!

  • @naveedkhattak7775
    @naveedkhattak77753 жыл бұрын

    First the first time, i have understood the things related to CL. Thank yoi

  • @YassineIabdounane

    @YassineIabdounane

    3 жыл бұрын

    I'm very happy to hear that! All the best

  • @runnihuang1161
    @runnihuang11614 жыл бұрын

    thank you so much!amazing course!

  • @YassineIabdounane

    @YassineIabdounane

    4 жыл бұрын

    My pleasure! I'm glad you liked it!

  • @MOCCLIVE
    @MOCCLIVE3 жыл бұрын

    It's awesome to learn different typer of Corpora.

  • @YassineIabdounane

    @YassineIabdounane

    3 жыл бұрын

    Thank you for watching!

  • @saralassri964
    @saralassri9644 жыл бұрын

    I am so proud of you!

  • @YassineIabdounane

    @YassineIabdounane

    4 жыл бұрын

    Thank you so much my dear!

  • @GhumroQadir
    @GhumroQadir2 жыл бұрын

    Very useful videos. I loved them.

  • @YassineIabdounane

    @YassineIabdounane

    2 жыл бұрын

    Thanks man! Happy to know that :)

  • @bashairj3156
    @bashairj31563 жыл бұрын

    Thannnkk you so much! Thank you Yassine!

  • @YassineIabdounane

    @YassineIabdounane

    3 жыл бұрын

    My pleasure!

  • @Lwahranya
    @Lwahranya Жыл бұрын

    Merci beaucoup !

  • @sweetASrere
    @sweetASrere4 жыл бұрын

    Great videos Yassine! Thank you

  • @YassineIabdounane

    @YassineIabdounane

    4 жыл бұрын

    Thank you Reina! I'm glad you find them useful!

  • @Enjoy.your_life34.

    @Enjoy.your_life34.

    3 жыл бұрын

    @@YassineIabdounane whats the example of Specialized corpora

  • @YassineIabdounane

    @YassineIabdounane

    3 жыл бұрын

    @@Enjoy.your_life34. a specialized corpus includes texts of a particular type, an example would be the Michigan Corpus of Academic Spoken English (MICASE)

  • @quincyjones7951
    @quincyjones79512 жыл бұрын

    excellent was very helpful - thanks!

  • @YassineIabdounane

    @YassineIabdounane

    2 жыл бұрын

    my pleasure! Happy you find it helpful :)

  • @radzuwan85
    @radzuwan852 жыл бұрын

    Thanks!

  • @mairasabdrahman3861
    @mairasabdrahman38612 жыл бұрын

    Tq for the information

  • @shifaais3129
    @shifaais31293 жыл бұрын

    God bless you!

  • @YassineIabdounane

    @YassineIabdounane

    3 жыл бұрын

    Thank you very much! God bless you too :)

  • @sheikhmuhammadnawaz1998
    @sheikhmuhammadnawaz19983 жыл бұрын

    Informative

  • @YassineIabdounane

    @YassineIabdounane

    3 жыл бұрын

    Thank you!

  • @FzFz-pn9gb
    @FzFz-pn9gb7 ай бұрын

    What are the type of registre. And please explain registre and geres

  • @nomansaeed2076
    @nomansaeed20763 жыл бұрын

    good

  • @prodibsa769
    @prodibsa7693 жыл бұрын

    bro, can you make a video on how to search binomial word pairs in a certain corpus, like COCA.

  • @YassineIabdounane

    @YassineIabdounane

    3 жыл бұрын

    To look for binomial in COCA simply use this expression: * _n* and * _n* That's about it bro :) PS: please delete the spaces between * and _ when you use the expression. I added them because a character between two * is printed in bold here in the comments like this *_n* and *_n*

  • @humairajabeen2573
    @humairajabeen2573 Жыл бұрын

    Aoa, sir how can I contact you for my PhD research in linguistics using corpus linguistics. thanks

  • @sabrinamalik2972
    @sabrinamalik29724 жыл бұрын

    Can you please elaborate statistical significance and significance test with examples? And also type-token ratio Please...

  • @YassineIabdounane

    @YassineIabdounane

    4 жыл бұрын

    On statistical significance and significance testing: Say that you have two corpora, one contains texts produced by men, and the other contains texts produced by women. You would like to see whether men use the word ‘wonderful’ more than women do. You compare the frequencies and you get that men have used the word 128 times while women have used it 110 times only. So, it seems that indeed men use ‘wonderful’ more than women do. Nevertheless, there is a number of things to consider, corpus size for example! Here’s the question, is the observed difference actually significant to claim that in general men use that word more than women do? or is it just a matter of chance and has nothing to do with men and women’s speech? To determine whether the difference is statistically significant and not due to chance, we need to use significance tests. One example would be the chi-square test. What the chi-square test does is that it compares the difference between the actual observed frequencies (128 and 110 in our case), with the expected frequencies ( the ones that we would expect if no factor other than chance had been involved). The closer these two results are to each other, the greater the probability that the observed frequencies are influenced by chance alone, hence the difference would not be significant. If you want to read more about it, I recommend this: www.lancaster.ac.uk/fss/courses/ling/corpus/Corpus3/3SIG.HTM Here’s more on expected frequencies and the chi-square test: kzread.info/dash/bejne/jIl7raioeLiugaw.html&t On type/token ratio: Type/token ration is a measure of lexical richness. In essence it gives you an idea about how many distinct words (types) are used in a text relative to the total number of words (tokens). It is calculated by dividing the total number of types by the total number of tokens. The closer the score is to 1, the richer the text (the more distinct words are used), the further it is from 1, the more repetitions you have in the text.

  • @sabrinamalik2972

    @sabrinamalik2972

    4 жыл бұрын

    Thank you so much.

  • @sittiesohaylagubat289
    @sittiesohaylagubat2893 жыл бұрын

    I have a question sir, how will i use corpus linguistics to this topic Singularization of "they" ? Hope you answer my question..thank you.

  • @YassineIabdounane

    @YassineIabdounane

    3 жыл бұрын

    It depends on what you want to study exactly. If you are interested in its historical development I would suggest using a historical corpus of English and see how the use of 'they' changes over the years.

  • @sittiesohaylagubat289

    @sittiesohaylagubat289

    3 жыл бұрын

    Thank you so much for responding my concern sir. I have a study research which in title of THE SINGULARIZATION "THEY" IN AN UNDERGRADUATE THESIS. In our matrix written in methodology. We will use Corpus Instrument instead. So in your own opinion, what exactly corpus were gonna use for our reaserch? Because, i'm not that familiar of corpus yet. There's a lot of questions in my mind about corpus. Thank you for responding again.

  • @andreanicole6548
    @andreanicole65482 жыл бұрын

    I love the winnie the pooh "repertoire" meme hehe

  • @YassineIabdounane

    @YassineIabdounane

    2 жыл бұрын

    makes you feel so fancy doesn't it? lol

  • @sabrinamalik2972
    @sabrinamalik29724 жыл бұрын

    Please explain Reference corpus.

  • @YassineIabdounane

    @YassineIabdounane

    4 жыл бұрын

    Hi Sabrina! A reference corpus is a corpus that you choose as a standard of comparison with the corpus you're working with. It is usually more general and representative of the source language as a whole and it is large enough to represent all relevant varieties of a language and its features. Here's how it is useful. Say you are working with a corpus of biology, and you want to display a list of keywords that are particularly characteristics of the type of discourse or language contained within that biology corpus. In this case, you'd need to compare this 'specialized corpus' with a more general 'reference corpus' so as to see the list of words that are particular to 'biology'.

  • @sabrinamalik2972

    @sabrinamalik2972

    4 жыл бұрын

    Excellent. Thank you so much.

  • @sabrinamalik2972

    @sabrinamalik2972

    4 жыл бұрын

    Refernce corpus and monitor corpus are same or different? Because when I searched examples Bank of English Is used as example for both corpora.

  • @YassineIabdounane

    @YassineIabdounane

    4 жыл бұрын

    Not all monitor corpora can be used as reference. A monitor corpus is one which grows in size over time. Still, the data that makes the corpus may not be general enough for the corpus to be used as reference. For instance, a monitor corpus of newspapers' data is certainly not a general corpus, or one to be viewed as 'a standard' for comparison.

  • @kollisoraya2938
    @kollisoraya29382 жыл бұрын

    😭

  • @dieths7776
    @dieths77763 жыл бұрын

    Can i ask? What is the purpose of corpus?

  • @YassineIabdounane

    @YassineIabdounane

    3 жыл бұрын

    A corpus is intended to be a representative sample of authentic language use. There are various types of corpora as you can see so specific research purposes would vary depending on the type of the corpus chosen. But the general aim I would say is to study how a language is used authentically in a given context (either generally, or across different regions, time periods, domains etc...)

  • @jaca2899
    @jaca28992 жыл бұрын

    you look like Snowden

  • @YassineIabdounane

    @YassineIabdounane

    2 жыл бұрын

    haha it's the glasses I think

Келесі