Pinecone AWS Reference Architecture Deployment Part 3

2023 ж. 30 Қар.

124 Рет қаралды

The Pinecone AWS Reference Architecture: github.com/pinecone-io/aws-re... is the fastest way to go to production with high-scale use cases leveraging Pinecone's vector database.
This video is part 3 of a 3-part series where we deploy the Reference Architecture end to end.
You can find the playlist dedicated to the Pinecone AWS Reference Architecture here: • Pinecone AWS Reference...
In this video, we continue to talk through the deployment as more resources come up within AWS.
Timestamps are linked below for easier navigation:
00:15 ECS target and autoscaling policies provisioning
00:31 Emu workers are up and healthy. Discussing autoscaling policy and behavior
1:05 Autoscaling configuration properties as "knobs"
1:55 Different autoscaling strategies and policies you can configure
3:30 Configuring the autoscaling parameters in code with Pulumi
4:00 AWS quotas and their importance to high-scale use cases
5:00 scaleInCooldown and scaleOutCooldown and what they do
5:40 Autoscaling behavior for Pelican and why it is slightly different from Emu
6:15 Deployment is complete at 14.5 minutes!
6:37 Stepping through AWS account to see what's running
7:30 Looking at RDS database
7:52 Sanity checking the Frontend ECS service and the table-driven semantic search UI
8:14 Load balancer integration for UI and how the UI looks
8:30 The deployment is healthy!
8:43 SQS queues