K8s Horizontal Pod Autoscaler | HPA Manifest File Explained | Pod Requests Limits | HPA Demo
Ғылым және технология
In this detailed kubernetes tutorial, we will look at EC2 Scaling Vs Kubernetes Scaling. Then we will dive deep into pod request and limits, Horizontal Pod Autoscaling (HPA) and how to code manifest file for it. Finally we will demo HPA in action!
Kubernetes Reference: kubernetes.io/docs/tasks/run-...
💰👨🎓 Get my courses with max discount and 30 day moneyback guarantee: bit.ly/3Eku9RH
*Connect with me*
🤳 Instagram: / cloudwithraj
🏢 LinkedIN: / rajdeep-sa-at-aws
🐦Twitter: / cloudwithraj
🖼 Facebook: / cloudwithraj
Пікірлер: 87
crisp and clear. Saved a lot of my time. Thank you Raj. Simply Brilliant
Awesome Video, really cleared my concept of HPA, and it is working with Cluster Autoscaler. Thank you Very much for the video.
Very clear and well illustrated !.
Great explanation. I was able to follow. You saved a lot of time. Thanks
Great video bro. Thank you very much.
Very well explained the concepts of autoscaling at different levels
Thanks Raj! You made my day! Great demo!
@cloudwithraj
2 жыл бұрын
Glad to hear that!
Dude, you're a real life super hero. keep it up !
@cloudwithraj
3 жыл бұрын
Haha thanks mediatwinkle, glad the video was helpful.
Beautifully explained. All my queries are cleared. Thank you.
@cloudwithraj
2 жыл бұрын
Glad it was helpful Dayananda
Excellent explanation and very apt presentation with the diagrams. Great work Sir...!
@cloudwithraj
4 жыл бұрын
Thanks Thrihesh for the kind words. I am glad you found it useful, thanks for watching.
Loved it.
nicely explained.. Thank you so Much!!!
As always awesome video Raj , Thank you so much ♥
@cloudwithraj
2 жыл бұрын
Glad you liked it
Excellent Raj ....now my all doubts related to HPA is cleared. Thank You so much
@cloudwithraj
2 жыл бұрын
You are most welcome
u r amazing raj
absolutely great video sir
@cloudwithraj
2 жыл бұрын
Thanks bro, glad you liked it
nice one.. simple and neat
Very good content.
Thanks for detailed explanation!
@cloudwithraj
2 жыл бұрын
Glad it was helpful!
Great presentation! Thanks so much.
@cloudwithraj
2 жыл бұрын
Glad you liked it!
Raj went beast mode with this tutorial. Very nice work!
@cloudwithraj
2 жыл бұрын
Appreciate it!
@byteslinger3187
2 жыл бұрын
@@cloudwithraj Hey Raj would you be willing to do a consultation? For payment of course. My company is stuck on the last piece to launch our server hosting business. Were having issues using the kubernetes JavaScript client to stream pod logs to our react front end. Its been websocket hell. Any help or advice would be awesome and again we can pay.
@cloudwithraj
2 жыл бұрын
@@byteslinger3187 Ty for the offer! Not doing any consultation currently
The explanation is really good, thank you
@cloudwithraj
2 жыл бұрын
Glad it was helpful!
Great presentation!
@cloudwithraj
2 жыл бұрын
Thanks for watching
I request you Raj to pls guide me step by step creation on jenkins pipeline ....demo to onboard docker service using jenkins pipeline. How we can mention bitbucket repos inside jenkins pipe and build the same.
Excellent explaining
@cloudwithraj
Жыл бұрын
Glad it was helpful!
Great video, really helped me get my head around the HPA concept thanks!
@cloudwithraj
3 жыл бұрын
Very welcome!
@hassanharrison3986
2 жыл бұрын
i guess I am kind of off topic but do anyone know of a good website to watch newly released tv shows online?
@raylanamir3359
2 жыл бұрын
@Hassan Harrison Flixportal :)
@hassanharrison3986
2 жыл бұрын
@Raylan Amir Thanks, I went there and it seems to work :D Appreciate it !
@raylanamir3359
2 жыл бұрын
@Hassan Harrison happy to help xD
Great demonstration. Question, how does the pod updates rolls out in HPA? For instance, I have HPA defined with minReplica's as 2 and max as 10, pods got scaled correctly when deployment triggered. However when I roll out update, it is supposed to bounce both the pods with new config, however I am seeing only 1 pod get bounced and I need to bounce the other one manually. How do I fix this?
veryy good. thanksssss
@cloudwithraj
2 жыл бұрын
Welcome 😊
Thank you so much
@cloudwithraj
Жыл бұрын
You're most welcome
life saver
thank you
Thanks!
@cloudwithraj
2 жыл бұрын
Wow, TY Ali for the donation, much appreciated.
Wowwwww
Nice Explanation 👍😊
@cloudwithraj
2 жыл бұрын
Thanks 🙂
Great video, can we use prometheus instead metrics server ??
All your kubernetes video are really great, can you please share more videos like ingress controler, node affinity
@cloudwithraj
4 жыл бұрын
Thanks for the kind words. Couple more Kubernetes videos coming soon!
it was very well explained , i have gone through many videos and docs . but getting more and more confused . specially explanation for targetCPUUtilizationPercentage was so much required . having it in demo is another level .
@cloudwithraj
Жыл бұрын
Glad it helped
In the CPU resource requests and limits 0.5 and 1.0 are NOT percentage numbers, they are ABSOLUTE numbers, and express cores. 0.5 means half a core. You need to correct this.
What if I want to have more than 512 mib for a pod? Do we then need to increase the millicores? Also can we take any amount of millicores like 567 or 666?
Executing kubectl get hpa, is returned 45%/50% with 3 replicas, for instance. This 45%, is the sum of all pods or among these 3 pods, is the pod with highest percentage? Nice explanation, I just would like to understand better this metric.
Hi thank you for this video, I just need to ask on 9:47 is it also possible to monitor the targetMemoryUtilizationPercentage in HPA?
@cloudwithraj
4 жыл бұрын
Yeah, it's possible. HPA can monitor lot of advanced metrics beyond the basic ones.
spec: maxReplicas: 8 minReplicas: 2 scaleTargetRef: apiVersion: apps/v1 kind: Deployment name: nginx targetCPUUtilizationPercentage: 60
on resources section, we are providing request and limit for cpu. A pod will be created with request cpu then how come it grows to use limit cpu ? Kindly clarify how request grows to complete the limit cpu
Eks namespace access level via aws sso video also we are looking...
But what is the role of cpu limit in hpa scalling up and down, do we really need it or we should remove it
I have set up my test program very similar to yours. I used apache2 ab command line to increase the load. I was able to see the number of pods increases as the cpu usages goes high. However, there was no time difference in Time per request or Total time taken when I manually increased number of replicas(let's say from replicas=2 to 4) . I played around number of request and number of concurrent requests options from ab command line. I was able to see an improvement in time per request with very particular set of request and concurrent request. I checked CPU usage with htop and it used 100% of CPU in very particular situation. So I had to make conclusion that increasing replicas doesn't always guarantee better performance for user's experience. I want to hear your opinion on this. Thank you
@aniceguy241
6 ай бұрын
because the purpose is to achieve higher availability, not better performance in speed, HA != faster
🆒👍
Can deployment has multiple replica s ? I mean deployment must have more than one replica to ensure ...!
where is the reference for the app, like selector shud be there in the HPA config, explanation is indeed good, but i think i missed it
I have an error : wget: can't connect to remote host:Connection refused. How can i fix it. My service type is ClusterIP
Small correction: mebibyte uses 1024 as base, while megabyte uses 1000 as base.
How to change default values in hpa
Can you scale based on memory?
@cloudwithraj
Жыл бұрын
Yeah!
why this dashboard get the metrics so long?
omg content.. can we have your Udemy course link? would try to enroll it
@cloudwithraj
2 жыл бұрын
bit.ly/3Eku9RH , max discount going on right now, expiring Monday
Kube will throttle the pod if cpu usage exceeds limit. Not kill it. There are other circumstances where kube will terminate pods.
kubernetes-sigs/metrics-server/releases/download/v0.3.7/components.yaml": no matches for kind "APIService" in version "apiregistration.k8s.io/v1beta1" ensure CRDs are installed first