Dataflow Gen2 in Microsoft Fabric - ULTIMATE GUIDE (Part 1 of 2)

Ғылым және технология

Dataflows are a key part of the Data Factory experience in Microsoft Fabric. With dataflows we can bring data into Fabric, transform it and load it into our OneLake.
This is part 1 of a 2-part guide to dataflows in Microsoft Fabric.
The 2nd video about dataflows features a real-life project using Dataflows - make sure you're subscribed for that one coming very soon!
Link to the Fabric.Guru article: fabric.guru/fabric-not-all-de...
0:00 Intro
0:39 Key Concepts
3:21 Get Data into Dataflow
5:55 Transforming Data
7:25 Outputting data to destination
8:00 Who is it for?
09:34 Comparison with other tools
12:32 Limitations
14:11 Wrapup

Пікірлер: 26

  • @pphong
    @pphong9 ай бұрын

    Thank you for the detailed presentation on Data Factory, especially the comparison between DFg2 and Data Pipeline.

  • @LearnMicrosoftFabric

    @LearnMicrosoftFabric

    9 ай бұрын

    No problem, glad you found it useful ☺️

  • @vt1454
    @vt14548 ай бұрын

    Great overview !!

  • @LearnMicrosoftFabric

    @LearnMicrosoftFabric

    8 ай бұрын

    Thanks for watching! :)

  • @karybuilds1611
    @karybuilds1611Ай бұрын

    Hey, loving this channel! where is the data set so i could follow along?

  • @LearnMicrosoftFabric

    @LearnMicrosoftFabric

    Ай бұрын

    Hey thanks, glad you're enjoying! Sorry but I actually can't remember, but it looks like an ourworldindata dataset!

  • @AmritaOSullivan
    @AmritaOSullivan8 ай бұрын

    What is the compute for DF gen 2? As I. Why is it slower than let’s say a fabric pipeline? Don’t they both use the MPP architecture? Thanks

  • @LearnMicrosoftFabric

    @LearnMicrosoftFabric

    8 ай бұрын

    Not entirely sure of the reason why it is slower tbh, and I haven't actually checked since posting this video, I could have already improved. I expect by the time Fabric reaches GA it will on a par with pipelines

  • @nishantkumar9570
    @nishantkumar95703 ай бұрын

    Which one would faster and better in term of compute cost as time saving, dataflow gen2 or running notebook?

  • @LearnMicrosoftFabric

    @LearnMicrosoftFabric

    3 ай бұрын

    Notebook I believe! But I am planning to do a side-by-side comparison of capacity usage for each of them soon 🙌

  • @user-vo8en9zh1i
    @user-vo8en9zh1i3 ай бұрын

    In terms of comparison to other tools, what about Azure Data Factory, particularly Mapping Data Flows? Mapping Data Flows seem to be missing from Fabric

  • @LearnMicrosoftFabric

    @LearnMicrosoftFabric

    3 ай бұрын

    You’re right, there’s no mapping dataflows in the Fabric Data Pipeline. in the documentation pages it says that these have been ‘replaced’ by Dataflow Gen2. I did a more recent video on data pipeline vs dataflows here and mentioned mapping dataflows kzread.info/dash/bejne/pmmht62afLrWeKg.html

  • @nies_diy986
    @nies_diy9868 ай бұрын

    Great tutorial , how we can transform multiple csv in a dataflow and then sink them in a lakehouse or warehouse using the same dataflow is it even possible i tried it but when i select the destination it is only taking one CSV to sink at destination i am stuck here

  • @LearnMicrosoftFabric

    @LearnMicrosoftFabric

    8 ай бұрын

    Yes that's possible! Although how you do it depends on where the CSVs are stored. If in Lakehouse Files area, use the Folder source in Dataflow Gen2. If in SharePoint, use the SharePoint Folder source. And if in Azure Blob or ADLS, you can use the blob connecter or ADLS connector respectively

  • @nies_diy986

    @nies_diy986

    8 ай бұрын

    @@LearnMicrosoftFabric Thanks a lot , i successfully moved files from Lakhouse to mywarehouse using dataflow however when i ran dataflow twice or thrice using append its duplicating and triplicating the records for example i have Department table ( CSV ) and then i appended using DF initially Dept table was having 3 records ID 1 DP1 , ID 2 DP 2 and ID3 DP3 now all of these 3 records are triplicated in my Data is there any configuration to stop this from happening i used the same CSV ( nothing changed ) and ran the datflow 3 times , my expectation was that Fabric will not append any record as the ID already exists in my current CSV ( nothing being changed in the CSV ) will appreciate your assistance

  • @AndyKabeer
    @AndyKabeer9 ай бұрын

    Hi, great video, how do we load data from REST API using Dataflow Gen2 ?

  • @LearnMicrosoftFabric

    @LearnMicrosoftFabric

    9 ай бұрын

    Hey, thanks! you can do it PowerQuery using Web.Contents()

  • @AndyKabeer

    @AndyKabeer

    9 ай бұрын

    thanks, thats what I am using (copy-pasting power query m-code from desktop to blank query in dataflow gen2 ) just if dataflow gen 2 had the same REST API connector as data pipeline

  • @LearnMicrosoftFabric

    @LearnMicrosoftFabric

    9 ай бұрын

    Yes it also has a UI based connector called Web API - its under the ‘Online’ section of the Connectors when you do Get Data ☺️

  • @cargouvu
    @cargouvu2 ай бұрын

    How did you get to the PowerQuery interface? It's sort of understood but I'm having trouble getting to it.

  • @LearnMicrosoftFabric

    @LearnMicrosoftFabric

    2 ай бұрын

    Ah sorry yes I should have shown that! Go to app.fabric.Microsoft.com Click on data factory Click New dataflow You should now be in the dataflow powerquery editor 👍

  • @EduInquisitive
    @EduInquisitive26 күн бұрын

    Is it possible to get data into fabric from multiple sources? if yes how can we do that?

  • @LearnMicrosoftFabric

    @LearnMicrosoftFabric

    26 күн бұрын

    with a dataflow? yes, just make multiple queries. Or you can split them out into multiple dataflows

  • @EduInquisitive

    @EduInquisitive

    26 күн бұрын

    @@LearnMicrosoftFabric Thank you

  • @PrafulThube
    @PrafulThube2 ай бұрын

    Where is part 2 of 2

  • @LearnMicrosoftFabric

    @LearnMicrosoftFabric

    2 ай бұрын

    Dataflows end-to-end project (Microsoft Fabric) + Lakehouse + Power BI kzread.info/dash/bejne/jKGbvLKhnJWygrg.html

Келесі