Scaling Cloud Community Infrastructure for the AI Period

Oct 16, 2024
The world has modified dramatically since generative AI made its debut. Companies are beginning to use it to summarize on-line opinions. Shoppers are getting issues resolved by way of chatbots. Staff are engaging in their jobs sooner with AI assistants. What these AI functions have in widespread is that they depend on generative AI fashions which have been skilled on high-performance, back-end networks within the information heart and served by way of AI inference clusters deployed in information heart front-end networks. Coaching fashions can use billions and even trillions of parameters to course of huge information units throughout synthetic intelligence/machine studying (AI/ML) clusters of graphics processing unit (GPU)-based servers. Any delays—akin to from community congestion or packet loss—can dramatically affect the accuracy and coaching time of those AI fashions. As AI/ML clusters develop ever bigger, the platforms which might be used to construct them have to assist larger port speeds in addition to larger radices (such because the variety of ports). A better radix permits the constructing of flatter topologies, which reduces layers and improves efficiency. Assembly the calls for of high-performance AI clusters Lately, we have now seen the GPU wants for scale-out bandwidth improve from 200G to 400G to 800G, which is accelerating connectivity necessities in comparison with conventional CPU-based compute options. The density of the info heart leaf should improve accordingly, whereas additionally maximizing the variety of addressable nodes with flatter topologies. To deal with these wants, we're introducing the Cisco 8122-64EH/EHF with assist for 64 ports of 800G. This new platform is powered by the Cisco Silicon One G200—a 5 nm 51.2T processor that makes use of 512G x 112G SerDes, which permits excessive scaling capabilities in only a two-rack unit (2RU) type issue (see Determine 1). With 64 QSFP-DD800 or OSFP interfaces, the Cisco 8122 helps choices for...

0 Comments