Corsair

Click to visit website
About
d-Matrix has developed Corsair, an AI inference platform for datacenters. Corsair boasts impressive performance, achieving 60,000 tokens/sec at 1ms/token latency for Llama3 8B and 30,000 tokens/sec at 2ms/token latency for Llama3 70B, all within a single server or rack, respectively. The platform is designed for speed, efficiency, sustainability, and scalability, addressing the growing need for performant and cost-effective Generative AI solutions. d-Matrix is backed by significant funding and comprises a team of experienced industry veterans. The company's focus is on making large-scale AI inference commercially viable and sustainable. They offer various job opportunities across several locations.
Platform
Task
Features
• energy efficient
• scales with model size
• 30,000 tokens/sec at 2ms/token latency for llama3 70b in a single rack
• 60,000 tokens/sec at 1ms/token latency for llama3 8b in a single server
• ultra low latency without compromising throughput
Job Opportunities
Technical Recruiter, Senior Staff G&A Recruiting
Corsair is a high-performance, energy-efficient AI inference platform designed for datacenters, offering blazing fast speeds and commercial viability.
Benefits:
Competitive pay and equity
Health care
Flexible time-off
Paid paternity leave
401k retirement plan
Show more details
Machine Learning Computer Architect, Staff - Workload Analysis
Corsair is a high-performance, energy-efficient AI inference platform designed for datacenters, offering blazing fast speeds and commercial viability.
Benefits:
Competitive pay and equity
Health care
Flexible time-off
Paid paternity leave
401k retirement plan
Show more details
AI Security Architect, Senior Staff
Corsair is a high-performance, energy-efficient AI inference platform designed for datacenters, offering blazing fast speeds and commercial viability.
Benefits:
Competitive pay and equity
Health care
Flexible time-off
Paid paternity leave
401k retirement plan
Show more details
Ratings & Reviews
No ratings available yet. Be the first to rate this tool!
Alternatives
Modular MAX
Modular's MAX is a free, open-source AI inference framework, complemented by the high-performance Mojo programming language. Enterprise support is also available.
View DetailsClarifai
Clarifai is the fastest AI inference and reasoning platform on GPUs, offering unmatched speed, significant cost reduction, and effortless scaling for AI models.
View Detailsailia AI Series
ailia AI Series is a world-class AI inference engine and SDK, developed with semiconductor expertise, offering cross-platform support for consistent AI development.
View DetailsBlumind
Blumind offers all-analog AI solutions for low-power, low-latency edge computing, targeting applications like voice UI, sensor analysis, and visual trigger detection across diverse industries.
View DetailsFriendliAI
FriendliAI provides a fast, cost-effective platform for deploying and managing generative AI models, including fine-tuning and monitoring capabilities.
View DetailsFuriosaAI
FuriosaAI designs and builds powerfully efficient AI accelerators and NPUs for enterprise and cloud AI inference, focusing on sustainable AI computing.
View DetailsMythic
Mythic provides power-efficient, high-performance analog computing solutions for AI inference applications across various sectors.
View DetailsUntether AI
Untether AI provides high-performance, energy-efficient AI inference accelerators for various industries, from cloud to edge deployments.
View DetailsAvian API
Avian is a high-performance AI inference platform offering industry-leading speeds for deploying and running large language models like DeepSeek R1 and HuggingFace LLMs.
View DetailsFeatured Tools
adly.news
adly.news is a 100% free newsletter advertising marketplace connecting businesses with engaged newsletter audiences, offering automated payouts and secure payments.
View DetailsEveryDev.ai
EveryDev.ai is a comprehensive community platform and directory for AI developers, offering a curated feed of tools, builds, news, and discussions for people shipping AI projects.
View DetailsWhisk AI Image Generator
Whisk AI Image Generator is a Google Labs-Powered Image Remix Platform that blends visual inputs (subject, scene, style) to create stunning 4K artwork quickly.
View DetailsAPIPASS
APIPASS is a unified marketplace for discovering, integrating, and managing thousands of APIs, providing developers with fast, reliable, and cost-effective access to leading AI models.
View DetailsVO4 AI
VO4 AI is the best AI video maker that turns your ideas into stunning videos. Make professional videos from text or images with our smart AI technology.
View DetailsSeedream 5.0
Seedream 5.0 is an online AI image generation platform powered by Bytedance Seedream 5.0 and Seedream V5, transforming text descriptions into stunning 4K visuals instantly.
View DetailsSeedream 5.0 Generator & Edit Studio
Seedream 5.0 is a lightning-fast AI Image Generator and editor powered by ByteDance Seedream 5.0, offering text-to-image creation, natural language editing, and 4K resolution output.
View DetailsKaomojiya
Kaomojiya is Japan's largest kaomoji collection site. It offers thousands of expressive kaomoji categorized for easy one-click copying and usage across all platforms.
View DetailsVO4 AI
VO4 AI is a professional AI video generator studio utilizing the VO4 Model to create stunning, cinematic 1080p videos from text prompts or static images.
View Details