Nexa AI

Click to visit website
About
Nexa AI is a development platform specializing in small multimodal models and accelerated edge inference, optimized for any device. It enables building high-performance AI apps on-device without model compression or edge deployment hassles. Nexa AI supports state-of-the-art models from DeepSeek, Llama, Gemma, Qwen, and Nexa's own Octopus, OmniVLM, and OmniAudio. It offers industry-leading on-device AI expertise, enabling developers to deploy optimized, local AI in hours, not months. The platform's features include multimodal model support, model compression, and local on-device inference.
Platform
Task
Features
• sota multimodal models
• enterprise-grade support
• accelerate time-to-market
• deploy on any device
• <1s processing time
• multimodality optimization
• model compression and quantization
• leading on-device ai accuracy
Job Opportunities
AI Research Scientist
Nexa AI: Accelerating Gen-AI tasks on any device. Build high-performance AI apps on-device without the hassle of model compression or edge deployment.
Experience Requirements:
Have at least 1 research project related to machine learning in which you played a major role
Significant Python, machine learning, research experience
Be familiar with C or C++
Responsibilities:
Create datasets for potential, powerful capabilities of language models such as function calling and reflection
Build tooling and infrastructure to enable efficient fine-tuning experiments on language models
Help develop new methods or novel fine-tuning techniques to improve language model behaviors
Run experiments that feed into key AI research
Show more details
Backend Engineer
Nexa AI: Accelerating Gen-AI tasks on any device. Build high-performance AI apps on-device without the hassle of model compression or edge deployment.
Education Requirements:
Minimum BS/MS in Computer Science
Experience Requirements:
2+ years of experience
Knowledge of OS internals, compilers, low-power/mobile optimization
Experience with low-level code C and frameworks like CUDA, OpenCL
Proficiency in multithreading and performance optimization
Other Requirements:
Excellent CS fundamentals (data structures, algorithms, coding)
Responsibilities:
Write stable, testable infrastructure
Develop our suite of SDKs on both Android and iOS
Diagnose and fix bugs and performance issues
Show more details
Deep Learning Engineer
Nexa AI: Accelerating Gen-AI tasks on any device. Build high-performance AI apps on-device without the hassle of model compression or edge deployment.
Education Requirements:
Minimum BS/MS in Computer Science
Experience Requirements:
2+ years of professional experience
Knowledge of operating system internals, compilers, and low-power/mobile optimization
Experience with low-level programming in C and frameworks like CUDA, OpenCL
Proficiency in multithreading and performance optimization
Other Requirements:
Excellent understanding of computer science fundamentals, including data structures, algorithms, and coding
Responsibilities:
Specialize in Google Cloud / AWS tech stacks
Familiarity with LLM technologies, particularly with the Transformers library
Experience with model compression is a plus
Knowledge of model deployment on edge device is a plus
Contribute to the development of our SDKs across multiple platforms, including Android, iOS, and Linux
Show more details
Ratings & Reviews
No ratings available yet. Be the first to rate this tool!
Alternatives
Emergent
Emergent is a platform that empowers users to build ambitious applications with AI, providing tools and resources for AI-powered development.
View Details
Rainmakers
Rainmakers is a company specializing in technology and AI development, offering services from AI/ML development to consulting and marketing.
View Details
Artkai
Artkai is a customer-centric digital product development agency offering end-to-end software development, product design, and modernization services for enterprises.
View Details
Vector Labs
Vector Labs is an AI consulting and engineering firm that provides expertise in AI, machine learning, data analytics, and custom software development to help businesses grow.
View Details
T-Bank AI Center
T-Bank AI Center develops AI technologies and products, conducts research, and offers educational programs in AI.
View DetailsFeatured Tools
GirlfriendGPT
NSFW AI chat platform with customizable characters, AI image generation, and voice chat. Explore roleplay and intimate interactions with AI companions.
View DetailsAnimate My Pic
Animate My Pic is an AI photo to video tool that leverages advanced AI to effortlessly animate your pictures, offering image-to-video, text-to-video, and 30+ effects.
View Details
KeevX
KeevX is an AI-powered platform for generating video ads, translating and dubbing videos with lip sync, and turning ideas into visual content.
View DetailsVoxdeck
Voxdeck is an AI tool that transforms ideas and documents into captivating, attention-grabbing slides and motion-rich presentations effortlessly.
View DetailsNano Banana AI
Nano Banana AI is a powerful AI image editor for quick, precise editing, adjustments, and optimization of images, leveraging advanced image-to-image AI models.
View DetailsNano Banana
Nano Banana is Google's state-of-the-art AI image generator powered by Gemini 2.5 Flash Image, offering character consistency and natural language image transformation.
View Details
alivemoment
alivemoment is an AI tool that transforms cherished photos into living stories, allowing users to relive precious moments with gentle, lifelike motion.
View Details