Sead Fadilpasic
2024-09-27 12:55:28
siliconangle.com
Artificial intelligence is undergoing a major transformation as AI cluster technology advances, revolutionizing how industries integrate and implement this powerful tool.
This emerging technology is not only speeding up AI processes but also making them more efficient and cost-effective, driving broader adoption and accessibility across various sectors, said David Kanter (pictured, middle), founder and head of MLPerf at MLCommons Association.
“One of the things that I’m excited about that the team has done recently is first of all, we’ve added a lot of gen AI benchmarks,” Kanter said. “Then we also added power measurement so that you can see whether it’s data center inference or training of these large-scale models, how much power and energy are you using? And we’ve seen in the five years that we’ve been around, we were able to get something like 50 times better performance, which is way faster than what we would expect.”
Kanter and Adi Gangidi (right), RDMA systems for AI training at Meta Platforms Inc., spoke with the host Rakesh Kumar (left), senior engineering leader at Juniper Networks Inc. at the Seize the AI Moment event, during an exclusive broadcast on theCUBE, SiliconANGLE Media’s livestreaming studio. They discussed how AI cluster technology and collaborative benchmarking efforts are driving the next phase of AI adoption by enhancing infrastructure efficiency, scalability and performance. (* Disclosure below.)
Best practices in AI cluster technology for scalable performance
One of the key drivers behind this exponential growth in AI performance is collaboration between organizations, academia and engineers. MLCommons, a nonprofit industry consortium, exemplifies this approach by bringing together diverse players to create standardized benchmarks that measure AI performance, Kanter added.
“So MLCommons really got started with the MLPerf benchmarks as part of what’s bringing us all together today,” he said. “And it was sort of founded in the early days of machine learning and we didn’t have good standard ways of measuring performance. And so, we got the whole community together. We built some standard measures for AI training, which became MLPerf training.”
This collaborative spirit is echoed by Meta, a founding member of MLCommons. Meta’s commitment to benchmarking, particularly in the Chakra work group focused on improving communications performance, demonstrates the integral role of network engineering in optimizing AI performance, Gangidi concluded.
“I think benchmarking and the work that David and MLCommons team is doing is important because with benchmarks you can understand how ML models stress infrastructure or what tasks they’re able to do,” he said. “You’re able to reproduce them and repeat them. If you cannot reproduce something, then it’s hard to improve it or to make it more reliable.Bbenchmarking is a very fundamental aspect of what helps scale these clusters.”
Here’s the complete video interview, part of SiliconANGLE’s and theCUBE Research’s coverage of the Seize the AI Moment event:
Here’s the complete event video playlist:
https://www.youtube.com/watch?v=videoseries
(* Disclosure: Juniper Networks Inc. sponsored this segment of theCUBE. Neither Juniper Networks nor other sponsors have editorial control over content on theCUBE or SiliconANGLE.)
Photo: SiliconANGLE
Your vote of support is important to us and it helps us keep the content FREE.
One click below supports our mission to provide free, deep, and relevant content.
Join our community on YouTube
Join the community that includes more than 15,000 #CubeAlumni experts, including Amazon.com CEO Andy Jassy, Dell Technologies founder and CEO Michael Dell, Intel CEO Pat Gelsinger, and many more luminaries and experts.
THANK YOU
Support Techcratic
If you find value in Techcratic’s insights and articles, consider supporting us with Bitcoin. Your support helps me, as a solo operator, continue delivering high-quality content while managing all the technical aspects, from server maintenance to blog writing, future updates, and improvements. Support Innovation! Thank you.
Bitcoin Address:
bc1qlszw7elx2qahjwvaryh0tkgg8y68enw30gpvge
Please verify this address before sending funds.
Bitcoin QR Code
Simply scan the QR code below to support Techcratic.
Please read the Privacy and Security Disclaimer on how Techcratic handles your support.
Disclaimer: As an Amazon Associate, Techcratic may earn from qualifying purchases.