- 76
- 296 062
Geert Baeke
Belgium
Приєднався 18 тра 2013
I post tech videos on all things cloud and cloud-native. Focus on Azure and Kubernetes!
The videos are usually recorded in one take but edited somewhat for fluency. I leave in the typos or other small issues. We all make mistakes once in a while and they are also a learning opportunity! ☺
Let's connect on Twitter @geertbaeke or LinkedIn www.linkedin.com/in/geertbaeke/
#azure #kubernetes #aks #cloud #cloudnative
The videos are usually recorded in one take but edited somewhat for fluency. I leave in the typos or other small issues. We all make mistakes once in a while and they are also a learning opportunity! ☺
Let's connect on Twitter @geertbaeke or LinkedIn www.linkedin.com/in/geertbaeke/
#azure #kubernetes #aks #cloud #cloudnative
Enhancing RAG with Knowledge Graphs and OpenAI GPT-4o Mini
This video discusses standard (vector-based) RAG, how it works and what its potential drawbacks are. It then introduces knowledge graphs as a possible solution and provides details about Microsoft's open source Graph RAG implementation.
Переглядів: 427
Відео
The fast path to creating and using LLM flows in your apps
Переглядів 3668 місяців тому
Use Microsoft Prompt Flow to design, test and run flows use in your LLM apps. Git: github.com/gbaeke/promptflow-session #azure #ai #promptflow
Should you use the Azure OpenAI Assistants API?
Переглядів 3,4 тис.9 місяців тому
There"s a new Azure OpenAI API in town: the assistants API. Should you ditch the existing API that's used in nearly all LLM apps when this new API is out? Watch to find out... Demo starts here - ua-cam.com/video/PAyDGtKf9ic/v-deo.htmlsi=gDZD5cLtiKkANgyc&t=1063 Repo: github.com/gbaeke/azure-assistants-api #azure #openai #assistant
Azure AI Search with a custom GPT-4 Vision skill
Переглядів 1,6 тис.Рік тому
In this video, we will use Azure AI Search to convert a bunch of images to a searchable index that contains the image description in text and as a vector. With such an index, you can search for images using natural language. The image description is created with GPT-4 Vision. In addition, we also store an image vector so you can search based on similar images. Code is here: github.com/gbaeke/vi...
Create a home automation assistant with OpenAI
Переглядів 655Рік тому
In this video, I use the OpenAI Assistant API to control my Hue lights using natural language and speech. Code: github.com/gbaeke/openai_assistant
Kubernetes Meetup: Container Apps
Переглядів 435Рік тому
This is a re-recording of the session I gave at the Belgian Kubernetes meetup on March 28th, 2023. Some things to note: - I mentioned Google Cloud Functions when I meant Google Cloud Run - CPU & Memory limits will likely change soon - Key Vault integration should come end of March 2023
Pinecone and OpenAI: Query your blog posts with natural language
Переглядів 2,2 тис.Рік тому
This video accompanies the following post: blog.baeke.info/2023/03/16/pinecone-and-openai-magic-a-guide-to-finding-your-long-lost-blog-posts-with-vectorized-search-and-chatgpt/ In it, we store blog posts as vectors in Pinecone, a vector database. Vectors are created with the OpenAI embeddings API. When the user asks a question, we vectorize the query string and find the closest matching vector ...
OpenAI Tweet Generator on Azure Container Apps
Переглядів 314Рік тому
Although the focus is on Azure Container Apps and deployment, we also briefly look at Azure OpenAI service and how the tweets are generated from Python code. Repo: github.com/gbaeke/aca-openai
AKS Developer Extension for VS Code
Переглядів 6392 роки тому
In this video, we will take a look at the AKS Developer Extension for VS Code. The extension supports draft, a tool that automatically creates a Dockerfile and Kubernetes deployment manifests for your code. In addition, it creates a GitHub workflow that deploys your code to Kubernetes. The workflow uses OIDC connect to securely authenticate to Azure. Video about the draft tool without the exten...
Publish Your AKS Services with Azure Private Link and Front Door
Переглядів 7 тис.2 роки тому
New functionality in the Azure Cloud Provider for Kubernetes allows you to create an Azure Private Link Service directly from a service manifest. Annotations in the manifest control the name of the service, number of NAT IP addresses and more. In this video, we will use the private link service to publish the Kubernetes service with Azure Front Door Premium.
Improve your development inner loop with Draft v2 for AKS
Переглядів 4642 роки тому
Draft v2 aims to improve developer productivity to go from code to deployed container faster. Some of the things it can do: generate Dockerfiles, Kubernetes manifests, Kustomize manifests, and ingress configuration. It leverages GitHub Actions to automate the deployment process. For authentication, it sets up GitHub OIDC integration with Azure AD. Related blog post: blog.baeke.info/2022/06/02/t...
GitOps on AKS with the Flux v2 Extension
Переглядів 3,4 тис.2 роки тому
In this video, we take a look at the Microsoft.Flux extension for AKS. We will look at: - installation - preparing the git repository - creating flux configurations For commands, please see the Quick Guide at github.com/gbaeke/quick-guides/blob/main/fluxv2/README.md
Linkerd Service Mesh with Nginx Ingress Controller
Переглядів 1,9 тис.2 роки тому
The basics of Linkerd, followed by installing a meshed application that's made available via a meshed Nginx Ingress Controller. Gist with commands: gist.github.com/gbaeke/a9adc4d98a0533bf2c3260fcd42d474d
Simplify GitOps with Flux and Visual Studio Code
Переглядів 1,8 тис.2 роки тому
Weaveworks have released a GitOps extension for Visual Studio Code that makes it easy to deploy and troubleshoot Flux. Create sources and kustomizations with ease, directly from the git repository you have open in VS Code. We will also look at some of the differences between Azure Kubernetes Service (AKS) and generic clusters. Git repo: github.com/gbaeke/vscode-demo Download the extension: gith...
Run Azure App Services anywhere with Azure Arc
Переглядів 7502 роки тому
With Azure App Services for Azure Arc, you can run Azure App Services on Kubernetes anywhere. Deploy web apps, function apps and logic apps to your location of choice. Gist with all commands: gist.github.com/gbaeke/4844db10e11f20e994110e2402fb384d #azure #kubernetes #kubernetes #arc #hybrid #cloud #edge
Working with revisions in Azure Container Apps
Переглядів 1,3 тис.2 роки тому
Working with revisions in Azure Container Apps
Deploying and scaling Azure Container Apps with Bicep and Keda
Переглядів 2,2 тис.2 роки тому
Deploying and scaling Azure Container Apps with Bicep and Keda
Microservices with Azure Container Apps and Dapr
Переглядів 10 тис.2 роки тому
Microservices with Azure Container Apps and Dapr
A blue-green deployment with Argo Rollouts and Kustomize
Переглядів 6 тис.3 роки тому
A blue-green deployment with Argo Rollouts and Kustomize
Private Azure Kubernetes Service and custom DNS
Переглядів 10 тис.3 роки тому
Private Azure Kubernetes Service and custom DNS
Private Azure Kubernetes Service DNS Options
Переглядів 5 тис.3 роки тому
Private Azure Kubernetes Service DNS Options
Azure App Services with Private Link
Переглядів 5 тис.3 роки тому
Azure App Services with Private Link
GitOps on Azure Arc-enabled Kubernetes
Переглядів 8223 роки тому
GitOps on Azure Arc-enabled Kubernetes
Azure Deployments with Bicep and Azure DevOps
Переглядів 5 тис.3 роки тому
Azure Deployments with Bicep and Azure DevOps
Building a GitHub Action (with Docker)
Переглядів 1,1 тис.3 роки тому
Building a GitHub Action (with Docker)
Admission Control on AKS with Azure Policy
Переглядів 1,5 тис.3 роки тому
Admission Control on AKS with Azure Policy
Azure Kubernetes Service authentication with Azure AD
Переглядів 6 тис.3 роки тому
Azure Kubernetes Service authentication with Azure AD
Deploying Helm Charts with Azure DevOps
Переглядів 16 тис.3 роки тому
Deploying Helm Charts with Azure DevOps
Thanks for this but whenever I try to run your private endpoint code as a test it keeps giving me an error that the defaultSubnet is not provided.
thanks, that's really helpful..👍
Great Video, thanks for putting in this much effort, exactly what i was looking for.
What about summarising chat contexts and using those summarises to reduce token size? Would it be safe to assume that because he Assistants API does not allow chat history to be controlled that we cannot achieve efficiency in this manner?
By far, the best tutorial on Dapr and Azure Container Apps..awesome!!
Thank you for the kind words!
Thanks for your illustration, I really learnt a lot in it!
I have a question i want to preprocess the text data and i have certain defined rules it should take care so should i use normal chat completion or assistant for that
It's best to use the chat completions API for that. In addition, if you want to extract structured data from text, I can recommend using the completions API with the Python instructor library.
Great video. Have a question. Even if I use completions or assistant api. The tokens for ALL the conversation Will be billed?
Yes, every input and output token is billed depending on the model chosen. That's the same for both APIs.
Clearly explained ! All the other videos are filled with useless babbling and go directly in Yaml without context. Thanks.
are you available on fiverr for custom work?
no, sadly not... 🙂
clearly and simply explained A+ 🎉
Great video - very helpful! I'm intrigued to know if the CLU (conversational language understanding) fits into the assistant model, I can't piece together in my head how this could be utilized when looking at the playground UI.. I guess it's probably something you'd need to code for (fire the prompt to CLU to extract intent, then pass to assistants completion API?) appreciate your thoughts :)
Underrated by UA-cam :) Thank you
Apart from file_ids could we get data from other data source like SQL Server, BQ.... to assistants? Please could you do code video on it.
The only way to do that is via function calling and custom code that gets the data from SQL etc.. and feeds it to the assistant. There's no built-in support for that just yet.
Amazing video!
its so difficult to find the repos which you have mentioned in all of your videos.. could have been more better if the repos were maintained properly..
It’s all in my free time and I try to do my best. These videos are old already so much of the code will not be very relevant anyway. Tnx for the tip…
superb explaination. Thanks alot
I was struggling with the service account concept and now that has gone, awesome content. thanks!!!
Hi can you show how we add two domains to the same origin group and two different routes
great video
hi, what tool do you use to create the diagram?, can you share the file? thanks.
Thank you for the demo! Really appreciate it. Do you know if there's a way to integrate an APIM with this kind of setup? Would the APIM go in between the load balancer and the cluster, or in front of the load balancer?
You can put APIM in front of the load balancer but APIM does not support private link to connect to backends. You can deploy premium or the new standard v2 which allows connection to internal backends.
It's fantastic to see you back on UA-cam after a long time. Please continue sharing, and I truly appreciate your blog and videos.
Thanks for the kind words. More to come soon I hope…
What is the point of all this...
I am in no position to answer existential questions.
Very nice. Thanks for sharing ❤
clear and concise , thank you :)
you didn't talk about the helm charts?
You made this look so simple!! greatly appreciate it!! Thank you
Hi, I was thinking about the blue/green and canary with Argo rollouts and they are an effective way to test being able to quickly rollback in case of issue. But what happens if one wants to just test online but exclusively in a private way? That would be a staging environment. An option could be to just have a secondary domain and deploy the website in that secondary domain temporarily and perhaps with a modified react component or any equivalent layer only giving access to one staging user? How do big companies make that staging environment to be possible to be online but without having it exposed to public in production? In data analytics applications it's easier because usually all is in an intranet but what happens with webs that are usually B2C applications?.
Thanks for the clear explanation and the architecture drawing. Helped me to finally find the mapping issue in my code. Cheers and have a nice day😊
I am glad it was helpful!
I was wondering if an alternative to using Argo rollouts would be to use Istio. It would add more complexity to the deployment in general but Istio already has the implementation for blue/green and canary that kubernetes basic networking lacks.
Argo Rollouts works together with Istio and other service meshes to manipulate the traffic as needed. Although you could use Istio on its own and configure it all yourself, using it in combination with Argo Rollouts or Flagger is easier.
Hi@@GeertBaeke , I thought if one is using Argo rollouts or Flagger, and then one had it configure it with Istio, the blue/green and canary features of Istio are not needed any more, one is only going to use the service mesh part (and others) of Istio.
@@davidpratrThat was indeed what I was pointing at. Flagger and Rollouts using the service mesh features to orchestrate the canary or blue/green deployment. I must admit that I do not use Istio that often and tend to use Linkerd. So I am not that familiar with Istio's standalone features. With Linkerd, you have to use it with Rollouts or Flagger because it has no such standalone functions. Last time I checked 🙂
@@GeertBaeke I see Flagger is more related to Flux. Well, I think it's clear that if one starts a project from scratch and one knows one will use ArgoCD, Argo rollouts is the way to go. Another thing would be if the project was already started with it's Blue/Green or canary with Istio, then it could be more challenging to adapt the whole thing to make the rollouts with both things Argo rollouts and the Istio rollouts capability.
Great video! Thank you very much! 🎓🥇🥉🥈🏆
Great Tutorial. when trying this is SSL Cert on the server side ( and of course the right configuration as you shown on the last part of the video , aiming for Https forward route , and even disabling health checks) , it seems the the command is returning <Service Not Available> [even though its there] . would be interesting to see how this is configured with HTTPS and SSL certificates
Great video!. Thank you!
I.e mean in the secrets
Azure credentials what are the things you have given
Can you clarify somewhat more? It's been a while since I made this video...
@@GeertBaeke in the secrets in the GitHub side what we need to give what are the things I am asking . Workflows getting failed.
Thanks and appreciate 🎉🎉
You’re welcome 😊
Good one Geert. Only vid on internet that deals with home automn and openai API with fn calling. Thanks!
Thank you and glad you liked it.
Thankyou Geert, found this extemely useful 🙂
Thanks for the video. Where can I find that online tool to generate patch from base YAMLs?
Nice demo)
This is really well done!!
FYI: Canary is pronounced "can-ay-ry". I hear it a lot from non-native English speakers. Hopefully it doesn't annoy you to mention this. Other than that, thanks for taking the time to give the world a very nice overview. Nice job.
Oh no not at all. The problem will be to remember it for next time. 😀
This is great
Why we need redis in this setup what is the purpose
It’s not required for Flux. It’s just an example of installing some infra components an app might need. Could be anything really…
As I want to add the service to configure the path for downstream request using kong ingress controller. I followed this approach but in the kong documentation says it is deprecated . can you please help in writing kind: Service type of resource in helm chart
Hello! Excelent explain. I Have a question about environment. How to create a way to not depend on the number of possible IP's in the VNET. Imagine, in Kubernetes we have our own technology taking care of the IP's, I understand that each app in the ACA takes an IP that I have available in the subnets. If my set of APP's grows I can reach a point of not having more IP's available
With a VNET-integrated Container Apps environment you will have to take the number of available IP addresses into account and make the subnet large enough. That is because each replica for a container app in the consumption plan takes up an IP address. There is of course a maximum number of replicas per environment so you can take that into account for the subnet size. However, when you use workload profiles, there's one IP per host not per container app. Of course, in that case, you do not use the consumption plan and you need to provision hosts to run your apps. In larger environments, we see customers having a benefit from using workload profiles so in that case, the IP limits are not at play.
Awesome demo, thank you!
Thank you! Much appreciated!
Geert You are rock star in explaining with easy examples!!!