Click any tag below to further narrow down your results
Links
OpenPCC is an open-source framework that enables private AI inference without revealing user data. It supports custom AI models and uses encrypted streaming and Oblivious HTTP to maintain user privacy. The project aims to establish a community-driven standard for AI data privacy.
The article discusses how companies are using NVIDIA's Blackwell platform to significantly lower the cost of AI token usage across various industries. By employing open-source models and optimized infrastructure, businesses in healthcare, gaming, and customer service have achieved considerable reductions in inference costs and improved performance.
InferenceMAX™ is an open-source automated benchmarking tool that continuously evaluates the performance of popular inference frameworks and models to ensure benchmarks remain relevant amidst rapid software improvements. The platform, supported by major industry players, provides real-time insights into inference performance and is seeking engineers to expand its capabilities.