AI Data Center Tour A Hardware Playground

Ғылым және технология

We get to tour a data center designed to let developers try out the latest AI hardware (and even some pre-release bits.) The Intel Developer Cloud has not just the company's Xeon CPUs, but also Xeon Max, Intel GPU Flex series, Intel GPU Max series (Ponte Vecchio), and the Habana Intel Gaudi / Gaudi 2 accelerators. If you ever wanted to see the hardware behind the Intel Developer Cloud, this is the tour.
STH Main Site Article: www.servethehome.com/touring-...
STH Top 5 Weekly Newsletter: eepurl.com/dryM09
----------------------------------------------------------------------
Become a STH YT Member and Support Us
----------------------------------------------------------------------
Join STH KZread membership to support the channel: / @servethehomevideo
STH Merch on Spring: the-sth-merch-shop.myteesprin...
----------------------------------------------------------------------
Where to Find STH
----------------------------------------------------------------------
STH Forums: forums.servethehome.com
Follow on Twitter: / servethehome
Follow on LinkedIn: / servethehome-com
Follow on Facebook: / servethehome
Follow on Instagram: / servethehome
----------------------------------------------------------------------
Other STH Content Mentioned in this Video
----------------------------------------------------------------------
- Intel Xeon Max: • This Server CPU is so ...
- PhoenixNAP Data Center Tour:
- Installing my own cloud server:
- Big AI servers need liquid cooling: • BIG AI Servers NEED This
----------------------------------------------------------------------
Timestamps
----------------------------------------------------------------------
00:00 Introduction
01:29 The Intel Developer Cloud
06:15 Touring Intel's Secret Data Center Development Center
07:15 Intel Xeon Max
07:50 Intel Data Center GPU Flex Series ATS-M
08:24 Habana Intel Gaudi 2 AI Accelerator
10:08 Big Power
10:32 AI Expansion in the Developer Cloud
11:01 Intel Data Center GPU Max 1100 Ponte Vecchio PCIe
11:38 Supermicro NVMe Storage Cluster
12:01 Networking 100GbE, 400GbE, and Firewalls
13:27 Global Intel Developer Cloud
13:51 Key Lessons Learned

Пікірлер: 96

  • @jolness1
    @jolness17 ай бұрын

    I love that you guys are so upfront about presenting anything that could remotely come close to “sponsored content”. Makes me have even more trust in STH. I wouldn’t blame you for not mentioning it because of course you need intel’s permission and maybe they paid for flights or whatever. But right away you guys mention it just so there is no ambiguity. I know that it’s still editorially independent though which is great. Appreciate the work you all do and Patrick’s infectious enthusiasm

  • @ServeTheHomeVideo

    @ServeTheHomeVideo

    7 ай бұрын

    Always like to explain what is happening.

  • @LaweHawre-lr3ju

    @LaweHawre-lr3ju

    5 ай бұрын

    رئر

  • @jonathanbuzzard1376
    @jonathanbuzzard13767 ай бұрын

    5kW a server is peasy. The problem is with the 10kW H100 based DGX servers. The coling is not the problem either, it is getting the power into the rack

  • @ServeTheHomeVideo

    @ServeTheHomeVideo

    7 ай бұрын

    We are going to have a demo on the STH main site from OCP Summit 2023 where they are now liquid cooling the power delivery to help power AI servers

  • @jonathanbuzzard1376

    @jonathanbuzzard1376

    7 ай бұрын

    @@ServeTheHomeVideo I don't think you get it. What follows is our *real* experience of trying get ready to handle these servers, some of which are already onsite. We currently can only accommodate a single H100 DGX box in a rack and the rest of the rack is more or less empty with some test boxes we don't care about. The reason is that if even a single PSU fails on a H100 DGX box it will cause a cascade failure of the rack as all the breakers trip. First off liquid cooling is at this juncture a waste of time. So let's look at the Nvidia H100 DGX server. It's 5U in height and 10.8kW power draw using six C19 leads. I can therefore squeeze a maximum of 8 of these in a 42U rack. That would require ~87kW of cooling. The 12 year old water cooled rear doors on my racks are good for 93kW of cooling so I am already good to go with my cooling and have been for over a decade. The problem is the power. So I upgrade the power delivery to the racks (this is underway) to get three 32A 3-phase supplies per rack. Being in Europe that gives me ~66kW of electrical power in the rack. This is going to be worse in North America with your anaemic electrical system. Immediately I now have to drop the number of servers per rack down to six because I can't power them. Not great but it is acceptable. However, it gets worse as I need six C19 outlets per server. The *vast* majority of zero U PDU's only have six C19 outlets so now I am down to three servers per rack. I did find one with 12 C19 outlets per PDU but on further investigation, it turns out some tuna melt paired the C19 outlets up and put each pair behind a 20A breaker so less useful than a chocolate tea pot which at least you can eat. Now I am down to getting custom PDU's made but they are dumb and don't come with power monitoring. It all really really sucks. If you drop back to an eight way A100 box at 4U from a tier one vendor then you can get 10 of them in a single rack no problems. The 66kW of power is sufficient and being in Europe with 230/240V AC I can use C13 to C20 leads without changing the laws of physics as happens in North America. So suitable PDU's are easy to come by. Note the C13/C14 connector is as designed rated for a maximum of 10A. Due to the anaemic electrical system in North America you have decided the laws of physics don't apply and it's magically safe to use with a 15A load! It's no wonder the rate of electrical fires is much higher in North America than in Europe.

  • @charlesclampitt5067

    @charlesclampitt5067

    7 ай бұрын

    The power draw for the power you get from the gaudi2 shows how efficient they are.

  • @JazekFTW
    @JazekFTW7 ай бұрын

    In any case as a hardware enthusiat this video was amazing and those 4U server... WOW

  • @ServeTheHomeVideo

    @ServeTheHomeVideo

    7 ай бұрын

    We have been reviewing the 8-10 GPU systems since like 2016-2017. They are always awesome.

  • @joseaguilera-lizano2461
    @joseaguilera-lizano24617 ай бұрын

    Pretty cool, thanks for the video!

  • @ServeTheHomeVideo

    @ServeTheHomeVideo

    7 ай бұрын

    Glad you enjoyed it. Have a great weekend.

  • @jerrytabeling
    @jerrytabeling7 ай бұрын

    Looks like free hardware tier noted in video is no longer available. Least expensive one I see is $0.45/hour for Small VM - Intel® Xeon® 4th Gen Scalable processor $0.45 / hour 8 cores, 16 GB memory, 20 GB disk

  • @scottmorgan5164
    @scottmorgan51647 ай бұрын

    Just a Sony FX3 chilling on the table like it ain't no thang. 😆 Cool video.

  • @ServeTheHomeVideo

    @ServeTheHomeVideo

    7 ай бұрын

    Canon R5C, C70 and Sony A1, FX30, and behind me a FX6 as well. This was one of the last videos shot before we fully took down the studio in Austin to move to Scottsdale so cameras were all on that shelf.

  • @hotscott6619
    @hotscott66197 ай бұрын

    great show when they swap that old stuff out id like to be at the swap meet

  • @--JYM-Rescuing-SS-Minnow

    @--JYM-Rescuing-SS-Minnow

    7 ай бұрын

    🍿

  • @ServeTheHomeVideo

    @ServeTheHomeVideo

    7 ай бұрын

    Ha! I wish that was an option.

  • @abosameh-tx7zl

    @abosameh-tx7zl

    7 ай бұрын

    چ

  • @ShdRuf

    @ShdRuf

    6 ай бұрын

    ​@@abosameh-tx7zl😂111e😊

  • @kelownatechkid
    @kelownatechkid7 ай бұрын

    Nice, thank you to Intel and Patrick for showing us this

  • @ServeTheHomeVideo

    @ServeTheHomeVideo

    7 ай бұрын

    This was a super fun video to do!

  • @Quamsi
    @Quamsi7 ай бұрын

    I think you might have broken the registration page, every time I click subscribe for the free tier I get an HTML only page that says "Request forbidden. Have a nice day!"

  • @ServeTheHomeVideo

    @ServeTheHomeVideo

    7 ай бұрын

    It happens sometimes.

  • @--JYM-Rescuing-SS-Minnow

    @--JYM-Rescuing-SS-Minnow

    7 ай бұрын

    🍿

  • @autarchprinceps
    @autarchprinceps7 ай бұрын

    Too bad Intel's AI/GPGPU products have come out close to a generation too late to be close to competitive. Nvidia certainly could use some competition to bring down pricing. They pretty much want your first born at this point, since they are basically selling anything they can produce no matter the price. But if you make back the entire cost in energy savings from the better efficiency, it will still be worth it for customers.

  • @ServeTheHomeVideo

    @ServeTheHomeVideo

    7 ай бұрын

    Word on the street is that Gaudi 2 is actually very competitive on a price/ performance perspective. Also, we covered this yesterday on the STH main site, but for only a few thousand GPUs (~$50M or less) NVIDIA is now pushing the L40S which is much lower power/ performance. Hopefully one day we can do Ponte Vecchio vs. L40S www.servethehome.com/nvidia-l40s-is-the-nvidia-h100-ai-alternative-with-a-big-benefit-supermicro/

  • @kimcharles7382
    @kimcharles73827 ай бұрын

    Just wondering what Camera vendor/model are u using? 😊

  • @ServeTheHomeVideo

    @ServeTheHomeVideo

    7 ай бұрын

    On this trip, in my hand is the Canon R5. I think we also had a Sony FX3 shooting some B-roll. Normal A-cam on the channel is the Canon C70.

  • @bits2646
    @bits26467 ай бұрын

    Nice, I'm a potential customer and I look for 10tTOPS AI machine, can't wait to go play with 10mln hardware :DDD

  • @JasonsLabVideos
    @JasonsLabVideos7 ай бұрын

    Nice work Patrick !! this is awesome !!

  • @ServeTheHomeVideo

    @ServeTheHomeVideo

    7 ай бұрын

    Thanks! It was super fun to do

  • @Darkk6969
    @Darkk69697 ай бұрын

    All of that AI hardware started SkyNet! 🤣 On serious note so friggin cool you were able to visit that data center. Can't tell you how jealous I am!

  • @ServeTheHomeVideo

    @ServeTheHomeVideo

    7 ай бұрын

    Thanks! It was super fun to see another part of it

  • @thomasoreilly6140
    @thomasoreilly61407 ай бұрын

    Looks pretty cool

  • @ServeTheHomeVideo

    @ServeTheHomeVideo

    7 ай бұрын

    Very cool indeed.

  • @greenvy2000
    @greenvy20007 ай бұрын

    Seems like the space has a great amount of heat capacity with the high ceiling. I wonder why have a high ceiling height versus a ceiling height of 8' or 9'.

  • @ServeTheHomeVideo

    @ServeTheHomeVideo

    7 ай бұрын

    Not bad in the Oregon climate.

  • @--JYM-Rescuing-SS-Minnow
    @--JYM-Rescuing-SS-Minnow7 ай бұрын

    wow! look's like a fun day @ the park! nice stuff Patrick! yup! ''never been a better time 2 use the Cloud''! it's 2 bad Wendell couldn't see all this!🍿🎉 yup! this is the great thing about ''quick pay plans'' 4 a server! goes beyond convenient, and fast! good luck!

  • @ServeTheHomeVideo

    @ServeTheHomeVideo

    7 ай бұрын

    I think Wendell has been to the other part that we showed in this video since he was at that event with me.

  • @AlexSchendel
    @AlexSchendel7 ай бұрын

    As a firmware engineer at Jones Farm, I hope I'll be able to catch you next time you're at JF :D

  • @ServeTheHomeVideo

    @ServeTheHomeVideo

    7 ай бұрын

    Ha! People have certainly seen me going out of the cafeteria and elsewhere and said hi. Please do come say hi if you see me

  • @velo1337
    @velo13377 ай бұрын

    do they have intel xeon max 9480 in the Dev Cloud?

  • @ServeTheHomeVideo

    @ServeTheHomeVideo

    7 ай бұрын

    I know there is Xeon Max in the Dev Cloud

  • @tmcarter3
    @tmcarter37 ай бұрын

    Always great and informative videos... Job well done!

  • @ServeTheHomeVideo

    @ServeTheHomeVideo

    7 ай бұрын

    Much thanks! Have a great day.

  • @thepastrecedes1635
    @thepastrecedes16357 ай бұрын

    Are you gonna be at SC next week? I kept seeing you last year but never had a chance to say hi

  • @ServeTheHomeVideo

    @ServeTheHomeVideo

    7 ай бұрын

    Yes I will be. I think we are going to have a few folks there

  • @sumseq

    @sumseq

    7 ай бұрын

    Hope to see you there!@@ServeTheHomeVideo

  • @wojtek1180
    @wojtek11807 ай бұрын

    Hi, i want to make my own home lab for AI, im wondering how hard it is, and where to start, I want to run llms I'm wondering, if there is better option than nvidia GPU, especially, since one gpu wouldn't have enough VRAM as i need, is there a way to go around that? or because with code llama llm's they just won't start if you dont have enough VRAM, is there any workaround this?

  • @jonathanbuzzard1376

    @jonathanbuzzard1376

    7 ай бұрын

    Give up is the short answer. If you can't get by with the RAM on a single GPU then it is not possible to do under tens of thousands of dollars at the moment.

  • @AlexSchendel
    @AlexSchendel7 ай бұрын

    I recall during the heat wave in Oregon a couple summers ago, we all got emails to shut off servers we didn't immediately need in Jones Farm because the AC couldn't keep up haha. Thankfully a lot of steps have been taken to prevent those extreme measures from needing to be taken again haha.

  • @johnmijo

    @johnmijo

    7 ай бұрын

    Very nice, nothing like having a day of 108 and then 112 and then finally that 116 temp :p I was thinking that PDX got renamed to Phoenix somehow....

  • @ServeTheHomeVideo

    @ServeTheHomeVideo

    7 ай бұрын

    I had a room there during a 100F+ day and the AC was struggling

  • @chengcao418
    @chengcao4187 ай бұрын

    what's that AC plug? It looks like an J1772 car charger lol

  • @krijo210
    @krijo2107 ай бұрын

    Harley Davidson. 65 hp for making noise. 5 hp for pushing you forward. Intel. 4950 W for heating the air. 50 W for doing useful work. Arm. :)

  • @redtails
    @redtails7 ай бұрын

    kinda sad if you consider the majority of those "ai workloads" might be analyses geared towards selling more junk to customers, i.e. advertisements, customer behavior analysis, stuff like that

  • @ServeTheHomeVideo

    @ServeTheHomeVideo

    7 ай бұрын

    AI is going to be used for almost everything, so not much we can do at this point.

  • @hubstrangers3450
    @hubstrangers34507 ай бұрын

    Thank you....please could demo a K8 cluster setup on that intel cloud platform, if time permits.....thank you again..

  • @ServeTheHomeVideo

    @ServeTheHomeVideo

    7 ай бұрын

    Cool idea. We actually did a $1 Proxmox VE cluster video at PhoenixNAP last year.

  • @GROW_YT_VIEWS_002
    @GROW_YT_VIEWS_0027 ай бұрын

    The amount of effort you put into this is evident.

  • @houtsothy2986
    @houtsothy29866 ай бұрын

  • @amdenis
    @amdenis6 ай бұрын

    No H100/H200 or even A100/V100 means only a small % of compute intensive AI devs would opt for it. The Intel is NOT more cost effective, unless you can’t get H100’s.

  • @ServeTheHomeVideo

    @ServeTheHomeVideo

    6 ай бұрын

    NVIDIA was showing in its slides for the latest MLPerf Training v3.1 that Intel Gaudi2 is more cost effective than H100. That is even just accelerator costs not including the fact that NVIDIA was using PCIe switches and Infiniband not Ethernet straight from Gaudi2. A100 is falling behind now since it does not support FP8 Transformer Engine

  • @Heffen89
    @Heffen897 ай бұрын

    Intesting sight that Intel uses 5p CEE connectors (the red ones) for the power supply of the racks. I would have expected some weird NEMA connector due to the location in the US. Do they really have 400V AC 3 phase power there?

  • @ServeTheHomeVideo

    @ServeTheHomeVideo

    7 ай бұрын

    I am not 100% sure, but I would not be surprised in the least given the cloud provider reliability lab is in the facility as well.

  • @eDoc2020

    @eDoc2020

    7 ай бұрын

    I doubt it's 400v, that's not a standard voltage here. It's much more likely to be 480/277 which is a very common industrial voltage here. It could even be 600/347 which is often used to feed large buildings or campuses which normally is stepped down to 120/208 closer to the point of use.

  • @jonathanbuzzard1376

    @jonathanbuzzard1376

    7 ай бұрын

    @@eDoc2020 It is highly likely to be 400V three phase. If you tap off from each phase you get 230V single phase. It makes delivering lots of power to a rack much easier and just about everything can handle 230V these days even if you are in North America.

  • @eDoc2020

    @eDoc2020

    7 ай бұрын

    @@jonathanbuzzard1376 I know _why_ 400v is often used but that doesn't change the fact that 480 is much more common here. Standard server PSUs usually are good for 100-240 volts nominal and there are also versions available for 277 volts (one leg of 480v). But you need to keep in mind this isn't a normal server, these AI servers are using over 10 kilowatts _each._ Rather than using the 230 or 277 volts of one leg they probably use the higher voltage directly. The PSUs might even have true three phase input.

  • @jonathanbuzzard1376

    @jonathanbuzzard1376

    7 ай бұрын

    ​@@eDoc2020 Blah, blah blah. Let me just check oh yes we have some of those 10kW AI servers in the racks at work. So I have *ACTUAL* experience of getting these things powered and cooled. The cooling is *NOT* an issue, the 12 year old water cooled rear doors can handle a rack full of them and still have headroom. Getting electrical power in is a different question and 3 phase PDU are the way it is done. Oh and *ALL* the vendor solutions for these servers have single phase power supplies. So stop trying to teach grandma to suck eggs. At the moment we can only get one of them per rack as we need serious power upgrades first which are been worked on, but hey what do I know about the subject. Note we have racks full of A100 GPU servers but the H100's have caught us off guard on how to actually get the power into the racks and into the servers. It is a complete sh*t show frankly.

  • @EyesOfByes
    @EyesOfByes7 ай бұрын

    Well, I guess the algorithm forces everyone to go all MrBeast-thumbnail. Cant blame you though. That's just how the system works now ;)

  • @ServeTheHomeVideo

    @ServeTheHomeVideo

    7 ай бұрын

    Yea :-/

  • @MakayMurray
    @MakayMurray7 ай бұрын

    How many times are you going to change your thumbnail? I count 5 times at least

  • @ServeTheHomeVideo

    @ServeTheHomeVideo

    7 ай бұрын

    The video has great watch time metrics, but CTR is not great, so trying different things. We are slightly too small to have access to the automatic KZread A/B thumbnail testing feature so we have to do it manually.

  • @charleshines2142
    @charleshines21427 ай бұрын

    I can imagine a name like Serve The Home University. Imagine all of the puns that would come about hehehe

  • @blademan7671
    @blademan76717 ай бұрын

    Never seen big ass noise suppression cans in a data center. Always seen ear plugs, usually in a dispenser next to the door.

  • @ServeTheHomeVideo

    @ServeTheHomeVideo

    7 ай бұрын

    Yea, they had both, but I missed the plugs.

  • @jonathanbuzzard1376

    @jonathanbuzzard1376

    7 ай бұрын

    Usually the ear plugs are for visitors, the regular users have ear defenders.

  • @lucandehaan4734
    @lucandehaan47347 ай бұрын

    WEF ?

  • @ServeTheHomeVideo

    @ServeTheHomeVideo

    7 ай бұрын

    hmm?

  • @Seris_
    @Seris_7 ай бұрын

    thats some questionable cable management for sure

  • @michaelknight2342

    @michaelknight2342

    7 ай бұрын

    Is it the tight cable turns?

  • @Seris_

    @Seris_

    7 ай бұрын

    @@michaelknight2342 Its the random cables velcro'd down the front of the cabinets that seem to go in random directions lmao

  • @ServeTheHomeVideo

    @ServeTheHomeVideo

    7 ай бұрын

    A lot of that is actually racks that are still in process.

  • @shiddygaymer
    @shiddygaymer7 ай бұрын

    "AI"

  • @user-ss7mi4dk2x
    @user-ss7mi4dk2x5 ай бұрын

    😅😘😘😘😅😅😅😅

  • @michaelmcconnell7302
    @michaelmcconnell73027 ай бұрын

    im not trying to be a jerk, but im gonna sound like a jerk. you could use some exercise my guy.

  • @ServeTheHomeVideo

    @ServeTheHomeVideo

    7 ай бұрын

    Totally. Rough doing 32 flights in 60 days.

  • @michaelmcconnell7302

    @michaelmcconnell7302

    7 ай бұрын

    @@ServeTheHomeVideo busy is good ❤

  • @AchikFollani
    @AchikFollani6 ай бұрын

    بسم الله الرحمن الرحيم اللهم صلي على محمد وال محمد وعلما ادم الاسماء كلها ثم عرضها غلى الملاءيكة فقال انبوني باسماء هولاء ان كنتم صدقين قالوا سبحانك لا علم لنا الاما علمتنا انك انت العليم الحكيم .صدق الله العظيم

  • @MaxCarponera
    @MaxCarponera7 ай бұрын

    Intel still exist?

  • @NeptuneSega

    @NeptuneSega

    7 ай бұрын

    You live under a rock?

  • @MaxCarponera

    @MaxCarponera

    7 ай бұрын

    @@NeptuneSega No I live in the present. Their 64 bits technology failed, their CPUs are still 7nm technology when everybody is 5nm and 4mm, their last domestic 14xxx family "refresh" is ridculous, lost all the market at consoles and Macs, their discrete GPUs releasing has been laughable, and their server family has been expensive and obsolete by ages in process of losing the insdustry market.

  • @snoot6629

    @snoot6629

    7 ай бұрын

    ​@MaxCarponera oh so you do know about them Why you ask if they still exist if you know they still do?

Келесі