Sizing of AI Clusters: What to Consider – Presented by World Wide Technology

Spring 2025

When deploying AI solutions, one of the biggest challenges is determining the right cluster size to efficiently train and run Generative AI (GenAI) models. How many GPUs are needed? What factors impact compute, memory, and storage requirements? And how do you balance performance, cost, and scalability when designing an AI infrastructure?

In this session, experts from World Wide Technology will provide practical guidance and an estimation model to help IT leaders and AI architects make informed decisions about AI cluster sizing.

Key topics will include:

  • Defining Your AI Use Case
  • Compute & Memory Considerations
  • Storage & Networking Needs
  • Scalability & Future-Proofing
  • Cost vs. Performance Trade-Offs
  • Introduction to an AI Sizing Estimation Model

Whether you’re building a new AI infrastructure or optimizing an existing deployment, this session will give you the tools and methodologies needed to size and scale AI clusters effectively.

Speakers:

Derrick brings 25+ years of experience as a technologist and leader in AI, Data Center and Cloud architectures, Enterprise Networking, Mobility, and Collaboration. Derrick is a Principal Solutions Architect within Global Solutions and Architecture (GS&A) at WWT responsible for High Performance Architecture, AI, and Data Analytics. Most recently, Derrick’s accomplishments include developing new Data Strategy, AI, and High Performance Architecture (HPA) workshops and working with our ATC innovation team on building the new WWT AI Lab referred to as the AI Proving Ground.

Related events