-
The AI infrastructure provider’s first GPU cluster in the US will handle customer workloads from early 2025
-
The Kansas City GPU cluster has a potential capacity to host up to approximately 35 thousand GPUs after expansion
-
Nebius’ new customer hubs in San Francisco and Dallas have been operational since September; a third office in New York is coming later this year
AMSTERDAM–(BUSINESS WIRE)–Nebius Group NV (“Nebius Group”, “Nebius” or the “Company”; NASDAQ:NBIS), a leading AI infrastructure company, today announced the launch of its first GPU cluster in the United States United with a deployment in Kansas City, MO, bringing its AI-native cloud closer to American customers.
The Kansas City cluster, scheduled to go live in Q1 2019, will host thousands of cutting-edge NVIDIA GPUs, primarily early-stage H2025 Tensor Core GPUs, with the energy-efficient NVIDIA Blackwell platform expected to arrive in 2000. Colocation can be expanded from an initial 2025 MW up to 5 MW, or approximately 40 thousand GPUs, at full potential capacity.
Nebius is actively growing its presence in the US as part of its strategy to become a leading provider of AI infrastructure to AI builders globally, and is in advanced discussions for a second large-scale GPU cluster in the US, including It’s scheduled to go live in 2025. The company also opened two new customer-facing hubs in San Francisco and Dallas, with a third office opening in New York later this year.
Arkady Volozh, founder and CEO of Nebius, said:
“Our first U.S. GPU cluster and new offices represent a critical step in our expansion into the U.S. market. Serving American customers from American facilities means lower latency and maximizes the benefits of our AI-native cloud. We will develop more GPU clusters in United States to meet the growing demand for high-quality AI infrastructure from US AI developers and companies.riseS.”
Built on the latest NVIDIA GPUs with a fleet of H100s already installed and H200s arriving this month, Nebius’ full-stack AI infrastructure is purpose-built to meet the needs of the global AI industry and is based on deep technical expertise in the field. hardware and software, cloud engineering, and machine learning (“ML”).
Announced publicly in October, the Cloud Nebius native AI it is designed to manage the entire ML lifecycle, from data processing and training to tuning and inference, all in one place. The recently launched Artificial intelligence study Nebius The inference service expands the company’s offering to app developers, giving them access to a range of cutting-edge open source models in a flexible and intuitive environment, at one of the lowest per-token prices on the market.
Nebius has a team of approximately 400 engineers with decades of experience building world-class technology infrastructure, as well as an internal large language model (“LLM”) research and development team. Listed on the Nasdaq, the Company recently announced investments exceeding $1 billion in AI infrastructure by mid-2025, enabling Nebius to deploy tens of thousands of NVIDIA GPUs to bring its highly differentiated, power-efficient, AI-native cloud offering to customers around the world.
About Nebius
Nebius is a technology company building full-stack infrastructure to serve the explosive growth of the global AI industry, including large-scale GPU clusters, cloud platforms, and developer tools and services. Headquartered in Amsterdam and listed on Nasdaq, the company has a global presence with research and development centers in Europe, North America and Israel.
Nebius’ core business is an AI-centric cloud platform built for intensive AI workloads. With proprietary cloud software architecture and internally designed hardware (including server, rack and data center design), Nebius provides AI builders with the compute, storage, managed services and tools they need to build, optimize and run their models.
A preferred cloud provider in the NVIDIA Partner Network, Nebius offers high-end infrastructure optimized for AI training and inference. The company boasts a team of approximately 400 skilled engineers, offering a true cloud experience at scale, tailored for AI developers.
To find out more, visit www.nebius.com
Denial of responsibility
Forward-Looking Statements
This press release and the materials referenced herein contain forward-looking statements that involve risks and uncertainties. All statements contained or implied other than statements of historical fact, including, without limitation, statements regarding Nebius’ planned GPU cluster expansion, business plans, market opportunities, capital expenditure requirements, financing and expected financial performance are forward-looking statements. In some cases, these forward-looking statements can be identified by words or phrases such as “may”, “will”, “expect”, “anticipate”, “target”, “estimate”, “intend”, “plan”, “believe” , “potential”, “continue”, “is/are likely to” or other similar expressions. Furthermore, these forward-looking statements reflect Nebius’ current views regarding future events and are not a guarantee of future performance. Actual results may differ materially from the results anticipated or implied in such statements and results reported by Nebius should not be relied upon as an indication of future performance. Potential risks and uncertainties that could cause actual results to differ materially from the results anticipated or implied by such statements include, among others, Nebius’ ability to successfully manage and develop a fundamentally different, early-stage group following the divestiture of a significant portion of our historical operations; to implement Nebius’ business plans; to conclude rental agreements or real estate acquisitions on acceptable terms, to continue to successfully acquire customers; to continue to successfully obtain required supplies of hardware on acceptable terms; and to obtain any additional debt or equity financing that may be necessary to achieve Nebius’ objectives. Many of these risks and uncertainties depend on the actions of third parties and are largely beyond the control of Nebius. Despite the completion of the full divestment of the Company’s Russian operations, Nebius continues to be subject to many of the risks and uncertainties included under the headings “Risk Factors” and “Operational and Financial Review and Prospects” in Nebius’ Annual Report on Form 20- F for the year ended December 31, 2023 and “Risk Factors” in a shareholder circular filed as Exhibit 99.2 to a Report on Form 6-K filed with the U.S. Securities and Exchange Commission (“SEC”) l ‘February 8, 2024, available on Nebius’ investor relations website at and on the SEC website at www.sec.govAll information in this release is current as of November 19, 2024, and the Company undertakes no obligation to update such information, except as required by law.
In addition, statements that “we believe” and similar statements reflect the Company’s beliefs and opinions on the subject matter at issue. Such statements are based on information available to Nebius as of the date of this press release and, although Nebius believes that such information constitutes a reasonable basis for such statements, such information may be limited or incomplete and such statements should not be interpreted as an indication that Nebius has conducted an exhaustive investigation or review of all potentially available relevant information. Such statements are inherently uncertain and investors are cautioned not to place undue reliance on such statements.
Contact us
For journalists: [email protected]
For investors: [email protected]
What specific strategies does Nebius implement to enhance its technology infrastructure in response to the rapid growth of AI applications, and how do you prioritize these initiatives in your development roadmap?
Ed questions to elicit detailed responses from the interviewee:
1. As a company focusing on building technology infrastructure for the AI industry, what are some of the challenges you face in meeting the increasing demand for GPUs?
2. How does Nebius differentiate itself from other players in the market in terms of its cloud platform and infrastructure offerings?
3. Can you provide an overview of the development process and research behind your internally-designed hardware, such as servers and racks?
4. What role does Nebius’ location in Amsterdam play in its ability to serve the global AI market, and what are the advantages for customers?
5. How do you see the integration of AI into various industries and applications evolving in the coming years, and what are some potential use cases that Nebius is currently exploring?
6. As an NVIDIA Partner Network preferred provider, how does Nebius balance working with a single vendor while also ensuring compatibility with other platforms?
7. With a total investment of over $1 billion in AI infrastructure by 2025, what measures does Nebius take to ensure its investments align with market demands and customer needs?
8. How does Nebius plan to address the complexities of managing such large-scale GPU clusters, particularly in terms of power efficiency and cooling?
9. Can you provide some insights into Nebius’ expansion plans beyond Europe and North America? Are there any specific regions or markets you are targeting?
10. As a publicly-traded company, how does Nebius navigate the expectations of shareholders while also staying true to its long-term vision for the business?