Indexify favicon

Indexify

Hiring
Indexify screenshot
Click to visit website
Feature this AI

About

Indexify is an open-source data framework designed for effortless ingestion and extraction of unstructured data at any scale for LLMs. It features a real-time extraction engine, pre-built extractors for various data types (documents, presentations, videos, audio), and supports custom extractor creation. Data retrieval is facilitated by semantic search and SQL querying. Indexify scales from local runtimes to large-scale Kubernetes deployments across multiple clouds. It also provides end-to-end observability and monitoring of ingestion, extraction, and retrieval processes.

Platform
Web
Task
data extraction

Features

semantic search

multi-modal support

runs on laptops and across large-scale deployments (kubernetes, vms, bare metal)

sql querying

custom extractor creation using sdk

reliable extraction for unstructured data (documents, presentations, videos, audio)

pre-built extraction adapters

real-time extraction engine

Job Opportunities

Indexify favicon
Indexify

Founding Applied AI Scientist

Indexify is an open-source, real-time data extraction framework for LLMs, supporting various data types and scalable deployments.

scienceonsiteSan Francisco, USfull-time

Benefits:

  • 401(k) plans

  • Comprehensive Healthcare and Dental Benefits

Education Requirements:

  • Ph.D. or Bachelor's degree in a quantitative field such as Computer Science, Mathematics, or equivalent industry experience

Experience Requirements:

  • 4+ years of experience working with AI/ML models, specifically in the fields of document understanding, computer vision, and multi-modal learning

  • Proven expertise in training and evaluating models for complex document extraction

  • Deep NLP Expertise

  • OCR Integration

  • Model Pretraining and Fine-tuning

Other Requirements:

  • Solid programming skills in Python and proficiency in at least one deep learning framework (e.g., TensorFlow, PyTorch)

  • Layout Analysis

  • Benchmarking and Evaluation

  • Vision-Language Models

Responsibilities:

  • Design, train, and evaluate document understanding models for extracting complex data

  • Develop and optimize multi-modal visual Q&A models

  • Collaborate with the team to integrate AI-driven features into Tensorlake’s platform

  • Work closely with users and customers to understand their needs

Show more details

Founding Backend Engineer

Indexify is an open-source, real-time data extraction framework for LLMs, supporting various data types and scalable deployments.

Benefits:

  • 401(k) plans

  • Comprehensive Healthcare and Dental Benefits

Education Requirements:

  • Ph.D. or Bachelor's degree in Math, Computer Science, or other quantitative fields, OR equivalent experience

Experience Requirements:

  • 7+ years of relevant work experience

  • Experience in building large-scale distributed systems

Other Requirements:

  • Knowledge of systems programming languages such as Rust, Go, C++, or C

  • Designing observable systems that operate at internet scale

  • Deep knowledge of operating and using cluster schedulers

Responsibilities:

  • Design and implement a distributed control plane for operating Indexify on public clouds

  • Design and implement workflows for cluster operations and bootstrapping in VPCs

  • Focus on long term operability of the system and services

  • Work closely with the Founder on the company's technical direction and platform

  • Work closely with our users to learn the impact of our product and improve their experience

Show more details

Founding Product Engineer

Indexify is an open-source, real-time data extraction framework for LLMs, supporting various data types and scalable deployments.

engineeringhybridSan Francisco, US
$150000 - $210,000
full-time

Benefits:

  • Healthcare, Dental and Vision Insurance

  • 401(k) plans

  • 5 weeks of PTO

Experience Requirements:

  • At least 7 years of front-end or full-stack development

  • Familiarity with technologies such as Python, React, Typescript, FastAPI, or SQLAlchemy

Other Requirements:

  • Motivated people who are excited to build tools to power the next generation of cloud applications

  • Passionate about working adjacent to users and the product

Responsibilities:

  • Develop delightful UIs or high quality backend business logic that empower software developers and simplify programming

  • Work with a team of leading distributed systems and machine learning experts

  • Communicate your work to a broader audience through talks, tutorials, and blog posts

  • Help us to build and shape a world class company

Show more details

Explore AI Career Opportunities

Ratings & Reviews

No ratings available yet. Be the first to rate this tool!

Alternatives

LiftData favicon
LiftData

LiftData provides real-time AI-powered data extraction from various content sources using a decentralized, scalable platform.

View Details
Gilio favicon
Gilio

Gilio processes documents with AI, extracting and transforming information for automation. It integrates with various systems via API and offers features such as data validation, document digitization, and workflow automation.

View Details
PDFMerse favicon
PDFMerse

PDFMerse is an AI-powered tool that transforms PDFs into structured data, offering automated extraction, enhanced accuracy, and versatile output formats.

View Details
Map Lead Scraper favicon
Map Lead Scraper

Map Lead Scraper is a Google Maps scraping tool that extracts local business data and contacts, saving hours of manual searches for lead generation.

View Details
InstantAPI.ai favicon
InstantAPI.ai

InstantAPI.ai is an AI web scraping API that extracts clean data from any webpage without requiring selectors, CAPTCHA handling, or extensive maintenance.

View Details
View All Alternatives

Featured Tools

GirlfriendGPT favicon
GirlfriendGPT

NSFW AI chat platform with customizable characters, AI image generation, and voice chat. Explore roleplay and intimate interactions with AI companions.

View Details
xMates AI favicon
xMates AI

xMates AI is a next-generation AI chat app powered by large language models, offering human-like interactions and roleplaying with customizable AI characters.

View Details
Promptix favicon
Promptix

Promptix is a macOS app that lets you run AI in any application with a hotkey. It helps you write faster, translate, polish text, and use custom prompts.

View Details
BestStock AI favicon
BestStock AI

BestStock AI is an AI-powered financial analysis platform, automating data processing and delivering predictive insights across financial instruments.

View Details
Wan 2.2 favicon
Wan 2.2

Wan 2.2 is an open-source AI video generation tool using MoE architecture, transforming text or images into professional 720P cinematic videos.

View Details
Wan 2.2 Animate favicon
Wan 2.2 Animate

Wan 2.2 Animate is a free online AI tool that transforms any character with advanced AI-powered animations, precise facial expressions, and dynamic body movements without registration.

View Details
Soora2 favicon
Soora2

Soora2 is a global Sora 2 AI video generation platform offering text-to-video, image-to-video, and AI editing tools without watermarks.

View Details
nexos.ai favicon
nexos.ai

nexos.ai is an all-in-one AI platform for enterprises, enabling secure, organization-wide AI adoption, policy setting, and oversight for tech leaders.

View Details