5 genomics file formats you must know

Ғылым және технология

FASTA, FASTQ, BAM, VCF, & BED on the command line.
Also see my video on command-line basics: Introduction to bash for data analysis: • Introduction to bash f... .
Get samtools: www.htslib.org/download/
Get bedtools: bedtools.readthedocs.io/en/la...
Good blog post on CRAM nuances: www.ga4gh.org/news/guest-post...

Пікірлер: 42

  • @GenomicsBootCamp
    @GenomicsBootCamp2 жыл бұрын

    Thanks for the VERY informative video! One follow-up on the "bed" files. In analyses related to SNP data using PLINK, the .bed files stand for "binary ped files", and hold genotypes potentially for the entire genome. They also do not stand on their own, but are coupled with .fam file that holds info on individuals, and .bim files that hold info on the chromosome and position of the SNP.

  • @suzannelong8090
    @suzannelong80902 жыл бұрын

    This was extremely helpful and interesting, thank you

  • @meghasailwal2554
    @meghasailwal25542 жыл бұрын

    You make the learning very easy. Thank you for making such interesting videos.

  • @JoseCastillo-wl4kp
    @JoseCastillo-wl4kp Жыл бұрын

    Excellent video. Very useful and clear. Congrats.

  • @shakedshanas1
    @shakedshanas12 жыл бұрын

    Great video, very informative and helpful when starting to use those files. I think every person who is mapping for the first time should absolutely watch that video to have a primary understanding about the files. I saw this video a few months ago and saw this again today, just for having a better understanding of the potential and using the command line to visualize the data. Thank you so much!

  • @abstractnonsense8344

    @abstractnonsense8344

    2 ай бұрын

    Yeah, I agree. I am just getting into this stuff and I found this content a great intro.

  • @dariushghasemi6476
    @dariushghasemi64762 жыл бұрын

    Extremely useful video! I really need your explanation to elucidate my nodding knowledge about various file formats. Many thanks! Keep producing more videos, PLEASE! :))

  • @austinleefers369
    @austinleefers3697 ай бұрын

    This is so good. Honestly, more useful than my whole grad school bioinfo course.

  • @danielromero-alvarez5392
    @danielromero-alvarez53922 жыл бұрын

    FANTASTIC VIDEO! thank you very much, I am just starting with this and nobody has taught me this so clearly! :)

  • @DocLithium
    @DocLithium2 жыл бұрын

    Hey! LOVE to see that you’re making videos more frequently. There might be less views on this one but keep going, you have absolutely great quality content and a great background to choose content from. Make videos about you PhD, your college and your work too about what you studied, what you do at work everyday and such. It’s a bit optimistic, but hoping to get to CSHL one day myself! PS: Make the video thumbnails more clickbait-y and graphically designed lol

  • @jasondotgen8267
    @jasondotgen82672 жыл бұрын

    Looking forward to that video on variant calls 😄

  • @xiapeter5618
    @xiapeter5618 Жыл бұрын

    This is a great introduction!

  • @edossamerga4814
    @edossamerga4814 Жыл бұрын

    Thank you for contribution in genomics I started to follow you on

  • @JoseCastillo-wx6jd
    @JoseCastillo-wx6jd Жыл бұрын

    Excellent video, thank you.

  • @subhaleenasarkar509
    @subhaleenasarkar5092 жыл бұрын

    Thank you ..it's so much helpful

  • @RenanSantos-px9ml
    @RenanSantos-px9ml2 жыл бұрын

    Very, very nice video!

  • @fenglei
    @fenglei Жыл бұрын

    Thanks for sharing this info.

  • @NatarajanGanesan
    @NatarajanGanesan Жыл бұрын

    Great video.

  • @kankit08
    @kankit082 жыл бұрын

    Thankyou for the knowledge sharing

  • @patricioperez1985
    @patricioperez19852 жыл бұрын

    Like it, love it, useful and fun.

  • @dariushghasemi6476
    @dariushghasemi64762 жыл бұрын

    Maria, please, you make many students like me cheerful if you make some videos or instructions about how do run GWAS, how to draw LocusZoom lots, how to compute Linkage Disequilibrium, or performing fine-mapping technique! I couldn't find any resources or tutorials yet neither on KZread nor in our institute through online courses!

  • @benysmart1643
    @benysmart1643 Жыл бұрын

    Very helpful, thanks

  • @PeihuiBrandonYeo
    @PeihuiBrandonYeo2 жыл бұрын

    This is great! thanks

  • @vincentweomd
    @vincentweomd2 жыл бұрын

    Thanks for the informative video. I'm new on this informatics but I'm planning to sequence more than 50.000 human WGS.

  • @sujitsilas6552
    @sujitsilas65522 жыл бұрын

    Mapping and aligning are slightly different concepts not to be confused with. But great video!

  • @nabildhifallah6964
    @nabildhifallah6964 Жыл бұрын

    bash is also important cause to data analysis thank you

  • @navinray
    @navinray2 жыл бұрын

    Thank you!

  • @petrosstyle2981
    @petrosstyle29812 жыл бұрын

    Maria which is in your opinion the best book in bioinformatics? which bioinformatics book did you really enjoy reading?

  • @ChathuraRanasingheOfficial
    @ChathuraRanasingheOfficial2 жыл бұрын

    1st comment, it's happy to see the video

  • @praveenrathore315
    @praveenrathore3152 жыл бұрын

    Very nice

  • @patricklogan6089
    @patricklogan60892 жыл бұрын

    Thank you

  • @betteniacole993
    @betteniacole9932 жыл бұрын

    Do you know what to annotate a sam file? This was a question in my bioinformatics class. usually I see bed files annotated instead. We are annotating from sam with fed features file

  • @PennytheBALLstar13
    @PennytheBALLstar132 жыл бұрын

    Are there any entry level tech jobs that you could recommend for a college student that could help you learn some of the necessary skills?

  • @genomicsandbioinformatics9628
    @genomicsandbioinformatics96282 жыл бұрын

    Great explanation, would you explain how ref and alt alleles are assigned in a vcf file. Is it assigned on the basis of allele frequency? As in a larger population there may be different types of snps such as A, C, T, G, then how only one snp is assigned as Alt allele? Is it assigned on the basis of its frequency in the population? E.g In different individuals of a population, there may be many possible snps at a specific position such as A, T, C, G. So who can we know that which snp could be the Alt allele?

  • @OMGenomics

    @OMGenomics

    2 жыл бұрын

    There can be multiple alt alleles at some positions in the genome. There isn’t one allele that is called the “alt”, in fact all of them that aren’t “ref” are “alt” alleles. The VCF simply includes all the alt alleles observed in the sample (or samples) at each position.

  • @genomicsandbioinformatics9628

    @genomicsandbioinformatics9628

    2 жыл бұрын

    OMGenomics many thanks for your quick answer. I think you didn’t get my point. I am asking about the REF and ALT allele columns in a vcf file. How Alt alleles are assigned in Alt allele Column? In vcf files, I have seen only one allele in the Alt allele column at a specific position. I am not talking about the samples. I just want to know how Alt allele are assigned in the Alt allele Column? Thanks in advance.

  • @OMGenomics

    @OMGenomics

    2 жыл бұрын

    Many positions only have one alt that has been observed, so that’s the one listed in the ALT column. But if you look around a VCF you’ll find rows with multiple alleles listed in the ALT column.

  • @partha_plethorapedia
    @partha_plethorapedia Жыл бұрын

    How to open FA file?

  • @Its_InduB
    @Its_InduB2 жыл бұрын

    Hi. Is this video linked with others as I didn't catched it. Also I am postgraduate student, working on crispr project. Can you please provide your email if possible. I have some query regarding my project. Thanks.

  • @albo8477
    @albo84775 ай бұрын

    U weet niet alleem genetica, maar ook het UNIX/LINUX commandlijn, meisje!🙂🙂👍 Dit is raar in onze dagen.

  • @esraaelsaeed1765
    @esraaelsaeed17652 жыл бұрын

    Can i contact with you by email I am seeking you advice

  • @jonasan478
    @jonasan4782 жыл бұрын

    4th viewer !! @u@

Келесі