Corsair

Click to visit website
About
d-Matrix has developed Corsair, an AI inference platform for datacenters. Corsair boasts impressive performance, achieving 60,000 tokens/sec at 1ms/token latency for Llama3 8B and 30,000 tokens/sec at 2ms/token latency for Llama3 70B, all within a single server or rack, respectively. The platform is designed for speed, efficiency, sustainability, and scalability, addressing the growing need for performant and cost-effective Generative AI solutions. d-Matrix is backed by significant funding and comprises a team of experienced industry veterans. The company's focus is on making large-scale AI inference commercially viable and sustainable. They offer various job opportunities across several locations.
Platform
Task
Features
• energy efficient
• scales with model size
• 30,000 tokens/sec at 2ms/token latency for llama3 70b in a single rack
• 60,000 tokens/sec at 1ms/token latency for llama3 8b in a single server
• ultra low latency without compromising throughput
Job Opportunities
Technical Recruiter, Senior Staff G&A Recruiting
Corsair is a high-performance, energy-efficient AI inference platform designed for datacenters, offering blazing fast speeds and commercial viability.
Benefits:
Competitive pay and equity
Health care
Flexible time-off
Paid paternity leave
401k retirement plan
Show more details
Machine Learning Computer Architect, Staff - Workload Analysis
Corsair is a high-performance, energy-efficient AI inference platform designed for datacenters, offering blazing fast speeds and commercial viability.
Benefits:
Competitive pay and equity
Health care
Flexible time-off
Paid paternity leave
401k retirement plan
Show more details
AI Security Architect, Senior Staff
Corsair is a high-performance, energy-efficient AI inference platform designed for datacenters, offering blazing fast speeds and commercial viability.
Benefits:
Competitive pay and equity
Health care
Flexible time-off
Paid paternity leave
401k retirement plan
Show more details
Ratings & Reviews
No ratings available yet. Be the first to rate this tool!
Alternatives
Modular MAX
Modular's MAX is a free, open-source AI inference framework, complemented by the high-performance Mojo programming language. Enterprise support is also available.
View DetailsClarifai
Clarifai is the fastest AI inference and reasoning platform on GPUs, offering unmatched speed, significant cost reduction, and effortless scaling for AI models.
View Detailsailia AI Series
ailia AI Series is a world-class AI inference engine and SDK, developed with semiconductor expertise, offering cross-platform support for consistent AI development.
View DetailsBlumind
Blumind offers all-analog AI solutions for low-power, low-latency edge computing, targeting applications like voice UI, sensor analysis, and visual trigger detection across diverse industries.
View DetailsFriendliAI
FriendliAI provides a fast, cost-effective platform for deploying and managing generative AI models, including fine-tuning and monitoring capabilities.
View DetailsFuriosaAI
FuriosaAI designs and builds powerfully efficient AI accelerators and NPUs for enterprise and cloud AI inference, focusing on sustainable AI computing.
View DetailsMythic
Mythic provides power-efficient, high-performance analog computing solutions for AI inference applications across various sectors.
View DetailsUntether AI
Untether AI provides high-performance, energy-efficient AI inference accelerators for various industries, from cloud to edge deployments.
View DetailsAvian API
Avian is a high-performance AI inference platform offering industry-leading speeds for deploying and running large language models like DeepSeek R1 and HuggingFace LLMs.
View DetailsFeatured Tools
adly.news
adly.news is a free platform that simplifies newsletter advertising, connecting businesses with engaged audiences through ad slots, offering bidding, negotiation, and messaging.
View DetailsAI Dubbing
AI Dubbing is a free AI video dubbing tool that uses advanced AI technology to provide natural, smooth, high-quality dubbing services, supporting 20+ languages and 100+ tones.
View DetailsGemini Watermark Remover
Gemini Watermark Remover is a client-side tool designed to remove hidden SynthID and other embedded watermarks from your AI-generated images, preserving quality.
View DetailsInfatuated.AI
Infatuated.AI is an AI companion platform allowing users to chat, roleplay, and build personalized relationships with AI girlfriends and boyfriends, offering emotional support and secure fantasy sharing.
View DetailsImgGen
ImgGen is the free AI editor that edits photos and turns images into videos in seconds, offering instant creativity all in one place.
View DetailsNano Banana
Nano Banana is a state-of-the-art AI model that revolutionizes text-based image editing and generation with unmatched multi-image fusion and natural language understanding.
View DetailsMacaron
Macaron is the world’s first personal AI agent designed to help you live better by focusing on happiness, health, and freedom, unlike typical productivity tools.
View DetailsVISBOOM
Visboom is the all-in-one AI fashion content creation platform, enabling brands and e-commerce sellers to generate on-model photoshoots and visual assets quickly.
View DetailsBanana AI
Banana AI is an advanced AI photo editor powered by Google’s Nano Banana technology (Gemini 2.5 Flash Image), enabling effortless image editing, restyling, and transformation with simple text prompts.
View DetailstwainGPT
twainGPT is a humanizer that transforms any AI-generated text into undetectable, human-like content, trusted by over 2.3 million users.
View DetailsAI Image Editor
AI Image Editor is a free online tool to edit, transform, and enhance photos with a text prompt, achieving fast, consistent, high-quality results.
View Details