#2 Introduction to Corpus Linguistics - Types of Corpora
Ғылым және технология
Hello there! In this video I present to you some of the most common types of corpora. This is by no means an exhaustive list but it's enough to get you started :)
If you want to use any of my videos, kindly let me know :)
Corpus annotation: Use TagAnt to tag your corpus for Part of Speech. (I'll talk about this in detail in future videos)
Download TagAnt: www.laurenceanthony.net/softw...
View Part of Speech Tags online: courses.washington.edu/hypert...
Contact me at: yassine.iabdounane@gmail.com
Пікірлер: 59
Sample Copora 02:25 Corpora for Comparison 05:13 General Corpora 09:50 Specialized Corpora 10:58 Annotated Corpora 11:35 Unannotated Copora 17:11 Learner Corpora 17:50
@ghmarioumaima5391
4 жыл бұрын
Hello! I want to email u but i couldn't find ur email. Would u please Write it for me and thanks.
@YassineIabdounane
4 жыл бұрын
@@ghmarioumaima5391 Hi Oumaima! Sorry about that. There you go: yassine.iabdounane@gmail.com
I am new to the field of Corpus Linguistics. I am learning too many things from your videos. Thank you for sharing such informative videos.
God bless you sir I'm so grateful for learning this gorgeous lesson Thank you so much 🥰❤️
Thank you for giving us insightful and organized lessons about the corpus linguistics!
@0101799
Ай бұрын
Also, it was very cute of you showing the "Helsinki" in the la Casa de Papel!!!!
Thank you so much, clearly explained.. i'm doing my master's degree in spain and curpus lingsuistics is a new concept to me.
Plz sir keep sharing your knowledge with us ❤
you are helping me a lot in my Master's degree in NLP. Thank you man ! Keep up the good work.
@YassineIabdounane
Жыл бұрын
thanks for the nice words man! best of luck with your Master's degree!
First the first time, i have understood the things related to CL. Thank yoi
@YassineIabdounane
3 жыл бұрын
I'm very happy to hear that! All the best
thank you so much!amazing course!
@YassineIabdounane
4 жыл бұрын
My pleasure! I'm glad you liked it!
It's awesome to learn different typer of Corpora.
@YassineIabdounane
3 жыл бұрын
Thank you for watching!
I am so proud of you!
@YassineIabdounane
4 жыл бұрын
Thank you so much my dear!
Very useful videos. I loved them.
@YassineIabdounane
2 жыл бұрын
Thanks man! Happy to know that :)
Thannnkk you so much! Thank you Yassine!
@YassineIabdounane
3 жыл бұрын
My pleasure!
Merci beaucoup !
Great videos Yassine! Thank you
@YassineIabdounane
4 жыл бұрын
Thank you Reina! I'm glad you find them useful!
@Enjoy.your_life34.
3 жыл бұрын
@@YassineIabdounane whats the example of Specialized corpora
@YassineIabdounane
3 жыл бұрын
@@Enjoy.your_life34. a specialized corpus includes texts of a particular type, an example would be the Michigan Corpus of Academic Spoken English (MICASE)
excellent was very helpful - thanks!
@YassineIabdounane
2 жыл бұрын
my pleasure! Happy you find it helpful :)
Thanks!
Tq for the information
God bless you!
@YassineIabdounane
3 жыл бұрын
Thank you very much! God bless you too :)
Informative
@YassineIabdounane
3 жыл бұрын
Thank you!
What are the type of registre. And please explain registre and geres
good
bro, can you make a video on how to search binomial word pairs in a certain corpus, like COCA.
@YassineIabdounane
3 жыл бұрын
To look for binomial in COCA simply use this expression: * _n* and * _n* That's about it bro :) PS: please delete the spaces between * and _ when you use the expression. I added them because a character between two * is printed in bold here in the comments like this *_n* and *_n*
Aoa, sir how can I contact you for my PhD research in linguistics using corpus linguistics. thanks
Can you please elaborate statistical significance and significance test with examples? And also type-token ratio Please...
@YassineIabdounane
4 жыл бұрын
On statistical significance and significance testing: Say that you have two corpora, one contains texts produced by men, and the other contains texts produced by women. You would like to see whether men use the word ‘wonderful’ more than women do. You compare the frequencies and you get that men have used the word 128 times while women have used it 110 times only. So, it seems that indeed men use ‘wonderful’ more than women do. Nevertheless, there is a number of things to consider, corpus size for example! Here’s the question, is the observed difference actually significant to claim that in general men use that word more than women do? or is it just a matter of chance and has nothing to do with men and women’s speech? To determine whether the difference is statistically significant and not due to chance, we need to use significance tests. One example would be the chi-square test. What the chi-square test does is that it compares the difference between the actual observed frequencies (128 and 110 in our case), with the expected frequencies ( the ones that we would expect if no factor other than chance had been involved). The closer these two results are to each other, the greater the probability that the observed frequencies are influenced by chance alone, hence the difference would not be significant. If you want to read more about it, I recommend this: www.lancaster.ac.uk/fss/courses/ling/corpus/Corpus3/3SIG.HTM Here’s more on expected frequencies and the chi-square test: kzread.info/dash/bejne/jIl7raioeLiugaw.html&t On type/token ratio: Type/token ration is a measure of lexical richness. In essence it gives you an idea about how many distinct words (types) are used in a text relative to the total number of words (tokens). It is calculated by dividing the total number of types by the total number of tokens. The closer the score is to 1, the richer the text (the more distinct words are used), the further it is from 1, the more repetitions you have in the text.
@sabrinamalik2972
4 жыл бұрын
Thank you so much.
I have a question sir, how will i use corpus linguistics to this topic Singularization of "they" ? Hope you answer my question..thank you.
@YassineIabdounane
3 жыл бұрын
It depends on what you want to study exactly. If you are interested in its historical development I would suggest using a historical corpus of English and see how the use of 'they' changes over the years.
@sittiesohaylagubat289
3 жыл бұрын
Thank you so much for responding my concern sir. I have a study research which in title of THE SINGULARIZATION "THEY" IN AN UNDERGRADUATE THESIS. In our matrix written in methodology. We will use Corpus Instrument instead. So in your own opinion, what exactly corpus were gonna use for our reaserch? Because, i'm not that familiar of corpus yet. There's a lot of questions in my mind about corpus. Thank you for responding again.
I love the winnie the pooh "repertoire" meme hehe
@YassineIabdounane
2 жыл бұрын
makes you feel so fancy doesn't it? lol
Please explain Reference corpus.
@YassineIabdounane
4 жыл бұрын
Hi Sabrina! A reference corpus is a corpus that you choose as a standard of comparison with the corpus you're working with. It is usually more general and representative of the source language as a whole and it is large enough to represent all relevant varieties of a language and its features. Here's how it is useful. Say you are working with a corpus of biology, and you want to display a list of keywords that are particularly characteristics of the type of discourse or language contained within that biology corpus. In this case, you'd need to compare this 'specialized corpus' with a more general 'reference corpus' so as to see the list of words that are particular to 'biology'.
@sabrinamalik2972
4 жыл бұрын
Excellent. Thank you so much.
@sabrinamalik2972
4 жыл бұрын
Refernce corpus and monitor corpus are same or different? Because when I searched examples Bank of English Is used as example for both corpora.
@YassineIabdounane
4 жыл бұрын
Not all monitor corpora can be used as reference. A monitor corpus is one which grows in size over time. Still, the data that makes the corpus may not be general enough for the corpus to be used as reference. For instance, a monitor corpus of newspapers' data is certainly not a general corpus, or one to be viewed as 'a standard' for comparison.
😭
Can i ask? What is the purpose of corpus?
@YassineIabdounane
3 жыл бұрын
A corpus is intended to be a representative sample of authentic language use. There are various types of corpora as you can see so specific research purposes would vary depending on the type of the corpus chosen. But the general aim I would say is to study how a language is used authentically in a given context (either generally, or across different regions, time periods, domains etc...)
you look like Snowden
@YassineIabdounane
2 жыл бұрын
haha it's the glasses I think