Corsair

Click to visit website
About
d-Matrix has developed Corsair, an AI inference platform for datacenters. Corsair boasts impressive performance, achieving 60,000 tokens/sec at 1ms/token latency for Llama3 8B and 30,000 tokens/sec at 2ms/token latency for Llama3 70B, all within a single server or rack, respectively. The platform is designed for speed, efficiency, sustainability, and scalability, addressing the growing need for performant and cost-effective Generative AI solutions. d-Matrix is backed by significant funding and comprises a team of experienced industry veterans. The company's focus is on making large-scale AI inference commercially viable and sustainable. They offer various job opportunities across several locations.
Platform
Task
Features
• energy efficient
• scales with model size
• 30,000 tokens/sec at 2ms/token latency for llama3 70b in a single rack
• 60,000 tokens/sec at 1ms/token latency for llama3 8b in a single server
• ultra low latency without compromising throughput
Job Opportunities
Technical Recruiter, Senior Staff G&A Recruiting
Corsair is a high-performance, energy-efficient AI inference platform designed for datacenters, offering blazing fast speeds and commercial viability.
Benefits:
Competitive pay and equity
Health care
Flexible time-off
Paid paternity leave
401k retirement plan
Show more details
Machine Learning Computer Architect, Staff - Workload Analysis
Corsair is a high-performance, energy-efficient AI inference platform designed for datacenters, offering blazing fast speeds and commercial viability.
Benefits:
Competitive pay and equity
Health care
Flexible time-off
Paid paternity leave
401k retirement plan
Show more details
AI Security Architect, Senior Staff
Corsair is a high-performance, energy-efficient AI inference platform designed for datacenters, offering blazing fast speeds and commercial viability.
Benefits:
Competitive pay and equity
Health care
Flexible time-off
Paid paternity leave
401k retirement plan
Show more details
Ratings & Reviews
No ratings available yet. Be the first to rate this tool!
Alternatives
Modular MAX
Modular's MAX is a free, open-source AI inference framework, complemented by the high-performance Mojo programming language. Enterprise support is also available.
View DetailsClarifai
Clarifai is the fastest AI inference and reasoning platform on GPUs, offering unmatched speed, significant cost reduction, and effortless scaling for AI models.
View DetailsBlumind
Blumind offers all-analog AI solutions for low-power, low-latency edge computing, targeting applications like voice UI, sensor analysis, and visual trigger detection across diverse industries.
View DetailsFriendliAI
FriendliAI provides a fast, cost-effective platform for deploying and managing generative AI models, including fine-tuning and monitoring capabilities.
View DetailsFuriosaAI
FuriosaAI designs and builds powerfully efficient AI accelerators and NPUs for enterprise and cloud AI inference, focusing on sustainable AI computing.
View DetailsFeatured Tools
GirlfriendGPT
NSFW AI chat platform with customizable characters, AI image generation, and voice chat. Explore roleplay and intimate interactions with AI companions.
View DetailsxMates AI
xMates AI is a next-generation AI chat app powered by large language models, offering human-like interactions and roleplaying with customizable AI characters.
View DetailsPromptix
Promptix is a macOS app that lets you run AI in any application with a hotkey. It helps you write faster, translate, polish text, and use custom prompts.
View DetailsBestStock AI
BestStock AI is an AI-powered financial analysis platform, automating data processing and delivering predictive insights across financial instruments.
View DetailsWan 2.2
Wan 2.2 is an open-source AI video generation tool using MoE architecture, transforming text or images into professional 720P cinematic videos.
View DetailsWan 2.2 Animate
Wan 2.2 Animate is a free online AI tool that transforms any character with advanced AI-powered animations, precise facial expressions, and dynamic body movements without registration.
View DetailsSoora2
Soora2 is a global Sora 2 AI video generation platform offering text-to-video, image-to-video, and AI editing tools without watermarks.
View Detailsnexos.ai
nexos.ai is an all-in-one AI platform for enterprises, enabling secure, organization-wide AI adoption, policy setting, and oversight for tech leaders.
View Details