37 links
tagged with all of: nvidia + ai
Click any tag below to further narrow down your results
Links
Fireworks AI, a California-based startup backed by Nvidia, has reached a $4 billion valuation in discussions with Lightspeed and Index Ventures, a remarkable increase from $552 million in the past year. The company focuses on democratizing AI infrastructure, enabling enterprises to easily deploy and scale advanced generative AI models while addressing significant resource and expertise gaps in the market.
Oracle's recent fiscal report revealed a staggering increase in contracted revenue, driven by rising demand for AI computing. The company predicts its cloud infrastructure revenue will reach $114 billion by 2029, suggesting a potential for significant growth similar to that of Nvidia.
NVIDIA CEO Jensen Huang promoted the benefits of AI during his visits to Washington, D.C. and Beijing, meeting with officials to discuss AI's potential to enhance productivity and job creation. He also announced updates on NVIDIA's GPU applications and emphasized the importance of open-source AI research for global advancement and economic empowerment.
Amazon Web Services is set to unveil an updated Graviton4 chip featuring 600 gigabits per second of network bandwidth, the highest in the public cloud. This advancement positions AWS to compete more effectively against Nvidia in the AI infrastructure market, as the company aims to reduce AI training costs and enhance performance with its upcoming Trainium3 chip. AWS's focus on custom chips illustrates its strategy to dominate the AI infrastructure stack and challenge traditional semiconductor companies like Intel and AMD.
The Trump administration plans to eliminate the Biden-era "AI diffusion rule," which imposed restrictions on the export of American technology. This move is seen as beneficial for chipmakers like Nvidia, who argued that the rule would complicate international sales. Following the announcement, Nvidia's stock experienced a notable increase.
Stripe has launched a new AI foundation model specifically designed for enhancing payment processing, which aims to streamline transactions and improve efficiency. In conjunction with this, the company has announced a strengthened partnership with NVIDIA to leverage advanced AI technologies in its services.
Nvidia is set to release a new AI chipset based on its Blackwell architecture for the Chinese market, priced between $6,500 and $8,000, significantly lower than its previous H20 model. The new chip will utilize conventional memory and simpler manufacturing processes, avoiding advanced packaging technologies from TSMC. This move comes as Nvidia adjusts to U.S. export restrictions while seeking to maintain its presence in China's data center market.
The Trump administration has halted its plans to restrict exports of Nvidia's H20 artificial intelligence chips to China following a dinner with CEO Jensen Huang at Mar-a-Lago. The decision comes after Nvidia pledged new U.S. investments in AI data centers, while Chinese companies have already placed significant orders for these advanced chips.
Nvidia has introduced DGX Cloud Lepton, a service that expands access to its AI chips across various cloud platforms, targeting artificial intelligence developers. This initiative aims to connect users with Nvidia's network of cloud providers, enhancing the availability of its graphics processing units (GPUs) beyond major players in the market.
Oracle plans to spend approximately $40 billion on high-performance Nvidia chips to support OpenAI's new data center in Abilene, Texas. This initiative is part of the U.S. Stargate Project, aimed at enhancing the country's position in the competitive AI industry. The purchase will involve around 400,000 of Nvidia's GB200 chips, which Oracle will lease to OpenAI.
The article discusses a critical vulnerability identified in NVIDIA's software, designated CVE-2025-23266, which poses significant risks to AI systems using NVIDIA hardware. It highlights the implications of this vulnerability, potential exploits, and the necessity for immediate patching by users to safeguard their systems.
Keith Heyde, newly appointed head of infrastructure at OpenAI, is leading the search for sites to build the company’s next-generation data centers, aimed at supporting the training of advanced AI models. With around 800 proposals received, about 20 sites are in advanced review, focusing on factors like power access and community support rather than just tax incentives. OpenAI's ambitious expansion includes a significant partnership with Nvidia, which is investing up to $100 billion to support the infrastructure needed for AI development.
NVIDIA has introduced a new AI blueprint that facilitates the integration between Blender and AI image generation tools, enhancing the workflow for 3D artists. This development aims to streamline the creative process, allowing users to leverage AI capabilities directly within their 3D modeling environment.
NVIDIA's Nemotron-H-8B-Base-8K is a large language model designed for text completion, featuring a hybrid architecture and a context length of 8K. It supports multiple languages and offers customization tools through the NeMo Framework for enhanced performance in research and development. The model is intended for use on NVIDIA GPU-accelerated systems and is part of the Nemotron-H collection, governed by specific licensing terms.
Nvidia has reported record sales driven by the ongoing AI boom, reflecting strong demand for its graphics processing units (GPUs) and other AI-related products. The company's financial performance highlights its pivotal role in the rapidly growing artificial intelligence sector.
The U.S. government has announced new restrictions on the export of artificial intelligence chips from companies like Nvidia and AMD to China, aiming to hinder the country's advancements in AI technology. This move reflects a broader strategy by the Trump administration to combat China's growing capabilities in the tech sector.
Nvidia has introduced an AI-driven model that simulates Earth's climate with unprecedented detail, allowing researchers to make predictions at a five-kilometer resolution. This advancement raises questions about the potential applications and implications of such powerful technology in climate science and beyond.
The NVIDIA HGX B200, now available in the Cirrascale AI Innovation Cloud, offers an 8-GPU configuration that significantly enhances AI performance, achieving up to 15X faster inference compared to the previous generation. With advanced features such as the second-generation Transformer Engine and NVLink interconnect, it is designed for demanding AI and HPC workloads, ensuring efficient scalability and lower operational costs.
Nvidia has launched the DGX Spark, a $4,000 desktop AI computer that offers one petaflop of performance and 128GB of memory in a compact design, aimed at facilitating local AI model development. Available for order starting October 15, the DGX Spark targets AI developers who require more memory capacity than standard PCs can provide, enabling the use of larger models without relying on cloud services.
Dell Pro Max, in collaboration with NVIDIA, is revolutionizing workflows across various industries by integrating advanced AI technologies. The podcast series "Reshaping Workflows" explores the impact of AI, digital twins, and edge computing on architecture, engineering, and creative fields, showcasing innovative applications and insights from industry leaders.
NVIDIA CEO Jensen Huang predicts that advancements in artificial intelligence will ultimately lead to increased workloads for individuals rather than reducing them. He emphasizes that while AI can automate certain tasks, it will also create new responsibilities and complexities, making people busier in the future.
The U.S. government has imposed a fee on exports of Nvidia's H20 chip and AMD's MI308 to China, both significant for AI applications. Nvidia has indicated the export restrictions previously cost it $4.5 billion in a single quarter, while demand for the H20 chip in China remains high. AMD has not yet commented on the situation.
Alibaba and Nvidia are expanding their partnership to enhance artificial intelligence capabilities, focusing on cloud computing and data processing. This collaboration aims to leverage Nvidia's advanced AI technologies within Alibaba's cloud services, potentially transforming various sectors in China and beyond.
Nvidia is working on a new AI chip built on its Blackwell architecture, aimed at outperforming its current H20 model available in China. Although U.S. President Trump has hinted at the possibility of allowing the sale of more advanced chips to China, regulatory approval remains uncertain due to security concerns. Samples of the new chip are expected to be delivered to Chinese clients as early as next month.
Nvidia has launched its Jetson AGX Thor robotics chip module, priced at $3,499 for developers, aimed at enabling companies to create advanced robots. The chips, which are 7.5 times faster than previous models and equipped with 128GB of memory, are part of Nvidia's strategy to capitalize on the growing robotics market, although it currently represents only 1% of the company's revenue. Major companies like Amazon and Boston Dynamics are already utilizing these chips for their robotic applications.
Two individuals have been arrested for attempting to smuggle AI chips from the U.S. to China, which raises concerns about national security and technology export regulations. Meanwhile, Nvidia has reiterated its stance against implementing kill switches for its products, emphasizing the importance of maintaining technological access.
NVIDIA has introduced a new AI pipeline aimed at revolutionizing the prototyping process for 3D artists, significantly reducing the time and effort needed for creating 3D models. This innovation could streamline workflows and enhance creativity in the design process.
Nvidia has made history by becoming the first company to reach a market value of $4 trillion, surpassing competitors Apple and Microsoft. Originally focused on enhancing personal computer graphics, Nvidia's rapid growth is largely attributed to its pivotal role in the AI boom, particularly in gaming and data centers.
Amazon has launched the EC2 P6e-GB200 UltraServers, featuring NVIDIA Grace Blackwell GPUs that offer exceptional performance for AI training and inference. These UltraServers can deliver up to 360 petaflops of FP8 compute and are designed for intensive AI workloads, supporting seamless integration with various AWS services. They are currently available in the Dallas Local Zone through EC2 Capacity Blocks for ML.
NVIDIA has released its powerful AI facial animation tool, allowing creators to generate realistic facial animations with ease. This tool is now accessible to everyone, enhancing the capabilities of artists and developers in various fields including gaming and film.
NVIDIA has unveiled its first Blackwell wafer manufactured in the US, marking a significant milestone in domestic chip production at TSMC's facility in Arizona. This advancement supports NVIDIA's aim to revolutionize the AI industry while reducing costs and energy consumption, and helps mitigate risks associated with tariffs and geopolitical tensions. The company plans to invest heavily in expanding AI infrastructure in the US.
Nvidia has acquired Enfabrica CEO Rochan Sankar and its technology for over $900 million, aiming to enhance its AI capabilities by connecting more than 100,000 GPUs. This move reflects a trend among tech giants to acquire AI talent through substantial investments rather than traditional acquisitions. Nvidia's latest investments also include a $5 billion stake in Intel for collaboration on AI processors.
The article discusses the evolution of NVIDIA's Tensor Core technology, tracing its development from the Volta architecture to the upcoming Blackwell architecture. It highlights key advancements in performance and capability, emphasizing how these improvements address the growing demands of AI and machine learning applications. The analysis provides insights into the implications of these technological changes for future computing tasks.
Nvidia is investing $1 billion for a 2.9% stake in Nokia, aiming to collaborate on artificial intelligence networking solutions and data centers. This partnership has driven Nokia's shares to their highest level in nearly a decade, with expectations for revenue contributions starting from 2027.
Nvidia has introduced a new GPU specifically designed for long context inference, aimed at enhancing performance in AI applications that require processing extensive data sequences. This innovation promises to improve efficiency and effectiveness in complex tasks, catering to the growing demands of AI technologies.
Eli Lilly has partnered with Nvidia to develop a powerful supercomputer aimed at accelerating drug discovery. The supercomputer will leverage AI capabilities to identify new molecules and reduce the lengthy drug development process, all while running on renewable energy within Lilly's facilities.
Alibaba Cloud has developed a new pooling system called Aegaeon that significantly reduces the number of Nvidia GPUs needed for serving large language models, achieving an 82% reduction during beta testing. This innovative system allows for better GPU utilization by virtualizing access at the token level, enabling multiple models to be served simultaneously and increasing output efficiency. The findings suggest potential advancements for cloud providers in managing GPU resources, particularly in constrained markets like China.