Title
AWS re:Invent 2022 - How Yahoo cost optimizes their in-memory workloads with AWS (DAT321)
Summary
- Yahoo moved their ad platform to AWS and optimized costs using Amazon ElastiCache.
 - Maulik Shah from Yahoo and Itai Meos from AWS presented the session.
 - Yahoo's ad platform generates 320 billion events per day, requiring efficient data processing.
 - Yahoo initially considered DynamoDB and HBase but settled on ElastiCache due to its performance and cost-effectiveness.
 - Data tiering in ElastiCache allowed Yahoo to reduce cluster size and costs by 50%.
 - Yahoo's architecture includes AWS Kinesis, S3, EMR, and Glue, with data querying via Athena or direct S3 access.
 - Yahoo learned to manage connections efficiently and implement exponential backoff with retries.
 - Yahoo plans to further optimize costs and migrate on-prem workloads to AWS.
 - AWS introduced data tiering, allowing storage expansion by moving data from memory to SSDs, saving up to 60% in costs.
 - Data tiering is transparent to applications and requires no code changes.
 - AWS also discussed MemoryDB for Redis, a durable in-memory database service.
 - New ElastiCache features include an improved management console, Memcached 1.6.12, encryption in transit, IPv6 support, Redis log delivery, native JSON support, AWS PrivateLink, Redis 7, and IAM authentication.
 - ElastiCache is available in all AWS regions and will be included in new regions.
 
Insights
- Yahoo's use case demonstrates the scalability and cost-effectiveness of AWS services for handling large-scale data workloads.
 - Data tiering is a significant innovation that allows for cost savings by intelligently managing data storage between memory and SSDs.
 - The session highlighted the importance of monitoring and managing connections to maintain performance and reliability.
 - AWS's commitment to customer feedback is evident in the development of data tiering and other new features.
 - MemoryDB for Redis offers durability and high performance, expanding the options for customers needing in-memory databases.
 - The introduction of new features and services like MemoryDB for Redis and data tiering indicates AWS's focus on continuous innovation and addressing diverse customer needs.
 - The global availability of ElastiCache ensures that customers can rely on consistent performance and features regardless of their geographic location.