Datastrato

Click to visit website
About
Datastrato is a platform built on Apache Gravitino, an open-source, high-performance, geo-distributed, and federated metadata lake. It offers features for managing data and AI assets, querying across lakehouse formats, connecting to various data engines, managing unstructured data and models, and enforcing governance across multiple regions and cloud providers. It provides a single control plane for access control and data virtualization for a unified experience across remote regions. It supports multi-cloud environments and offers flexible deployment options.
Platform
Task
Features
• role-based access control (rbac)
• single-sign on
• metadata management for various data types
• enterprise-ready connectors
• lakehouse federation
• python client
• permission management system and rbac
• governance across multiple regions and cloud providers
Job Opportunities
Distributed Data Systems - Senior Software Engineer
Datastrato is an open-source data and AI governance platform built on Apache Gravitino. It helps manage data lakes, AI assets, and enforces governance across various cloud providers.
Benefits:
Generous equity options package
Flexible working hours
Freedom of choice for your technical equipment
Wonderful, highly qualified colleagues
Truly international: many different nationalities A transparent, collaborative & inclusive culture Exciting opportunities for career progression as we grow Little to zero controls combined with autonomous work where you set your own pace in a collaborative environment
Education Requirements:
BS in Computer Science, related technical field, or equivalent practical experience.
Optional: MS or PhD in databases, distributed systems
Experience Requirements:
At least 5 years of experience in software engineering with a focus on distributed data systems
Other Requirements:
Proficiency in programming languages such as Java/Scala, C++, or Rust.
Experience with distributed systems, databases and big data systems (Spark, Hadoop, and others)
Responsibilities:
Design and develop distributed data systems from the ground up, including areas like: Geo-distributed consensus system, High availability distributed system
Collaborate with other team members to identify and solve complex distributed, performance, and engineering problems
Mentor and provide guidance to junior engineers
Show more details
Distributed Data Systems - Architect
Datastrato is an open-source data and AI governance platform built on Apache Gravitino. It helps manage data lakes, AI assets, and enforces governance across various cloud providers.
Benefits:
Generous equity options package
Flexible working hours
Freedom of choice for your technical equipment
Wonderful, highly qualified colleagues
Truly international: many different nationalities A transparent, collaborative & inclusive culture Exciting opportunities for career progression as we grow Little to zero controls combined with autonomous work where you set your own pace in a collaborative environment
Education Requirements:
BS in Computer Science, related technical field, or equivalent practical experience.
Optional: MS or PhD in databases, distributed systems
Experience Requirements:
At least 8 years of experience in software engineering with a focus on distributed data systems
Other Requirements:
Proficiency in programming languages such as Java/Scala, C++, or Rust
Responsibilities:
Design and develop distributed data systems from the ground up, including areas like: Geo-distributed consensus system, High availability distributed system
Collaborate with other team members to identify and solve complex distributed, performance, and engineering problems
Mentor and provide guidance to junior engineers
Show more details
Database Engine Internals - Senior Software Engineer
Datastrato is an open-source data and AI governance platform built on Apache Gravitino. It helps manage data lakes, AI assets, and enforces governance across various cloud providers.
Benefits:
Generous equity options package
Flexible working hours
Freedom of choice for your technical equipment
Wonderful, highly qualified colleagues
Truly international: many different nationalities A transparent, collaborative & inclusive culture Exciting opportunities for career progression as we grow Little to zero controls combined with autonomous work where you set your own pace in a collaborative environment
Education Requirements:
BS in Computer Science, related technical field, or equivalent practical experience.
Optional: MS or PhD in databases or related field
Experience Requirements:
At least 5 years of experience in software engineering with a focus on database engine internals
Other Requirements:
Proficiency in programming languages such as C++, Rust and Java
Experience with database internals, query processing, and optimization
Responsibilities:
Design and implement database engine from the ground up, including areas like: Query compilation and optimization, Distributed query execution and scheduling, Efficient storage structures for data and metadata
Collaborate with other team members to identify and solve complex compilation, performance, and engineering problems
Mentor and provide guidance to junior engineers
Show more details
Ratings & Reviews
No ratings available yet. Be the first to rate this tool!
Alternatives
OneTrust
OneTrust automates privacy, security, and governance to build trust and manage risk. It offers solutions for consent, privacy, third-party risk, tech risk, and AI governance.
View Details
basebox
basebox is a secure AI management system that allows companies to control AI usage within their infrastructure, ensuring data security and process control. It supports various use cases with flexible deployment options.
View Details
KADA
KADA is an intelligent copilot for building data trust across your organisation. It combines data observability & governance to provide data discovery, lineage, monitoring and more.
View Details
Relyance AI
Relyance AI is a platform for privacy, security, and AI governance, providing data mapping, risk assessment, and compliance management.
View Details
Spawning AI
Spawning provides data governance tools for generative AI, including a Do Not Train Registry and tools for rights holders and AI developers to manage data preferences.
View DetailsFeatured Tools
Songmeaning
Songmeaning uses AI to reveal the stories and meanings behind song lyrics. It offers lyric translation and AI music generation.
View DetailsWhisper Notes
Offline AI speech-to-text transcription app using Whisper AI. Supports 80+ languages, audio file import, and offers lifetime access with a one-time purchase. Available for iOS and macOS.
View DetailsGitGab
Connects Github repos and local files to AI models (ChatGPT, Claude, Gemini) for coding tasks like implementing features, finding bugs, writing docs, and optimization.
View Details
nuptials.ai
nuptials.ai is an AI wedding planning partner, offering timeline planning, budget optimization, vendor matching, and a 24/7 planning assistant to help plan your perfect day.
View DetailsMake-A-Craft
Make-A-Craft helps you discover craft ideas tailored to your child's age and interests, using materials you already have at home.
View Details
Pixelfox AI
Free online AI photo editor with comprehensive tools for image, face/body, and text. Features include background/object removal, upscaling, face swap, and AI image generation. No sign-up needed, unlimited use for free, fast results.
View Details
Smart Cookie Trivia
Smart Cookie Trivia is a platform offering a wide variety of trivia questions across numerous categories to help users play trivia, explore different topics, and expand their knowledge.
View Details
Code2Docs
AI-powered code documentation generator. Integrates with GitHub. Automates creation of usage guides, API docs, and testing instructions.
View Details