Enterprise AI Data Agents
From day one, Genesis Data Agents come ready with advanced skills and tools to securely automate data workflows so you can focus on high-value impact.
An open source, unified model and set of language-specific SDKs for defining and executing data processing workflows
Apache Beam is an open source, unified model and set of language-specific SDKs for defining and executing data processing workflows, and also data ingestion and integration flows, supporting Enterprise Integration Patterns (EIPs) and Domain Specific Languages (DSLs). Dataflow pipelines simplify the mechanics of large-scale batch and streaming data processing and can run on a number of runtimes like Apache Flink, Apache Spark, and Google Cloud Dataflow (a cloud service). Beam also brings DSL in different languages, allowing users to easily implement their data integration processes.
Data engineering without the complexity
Ship reliable data pipelines with built-in testing, documentation, and monitoring.
lakeFS brings software engineering best practices and applies them to data
lakeFS provides version control over the data lake, and uses Git-like semantics to create and access those versions. If you know git, you'll be right at home with lakeFS.With lakeFS, you can apply concepts to your data lake such as branching to create an isolated version of the data, committing to create a reproducible point in time, and merging in order to incorporate your changes in one atomic action.
The best dashboards are built with code.
Observable is the modern platform for developing and hosting powerful, performant, polyglot data products built on open source.
Create a data-driven culture with BI for all
Turn your data into visuals with advanced data-analysis tools, AI capabilities, and a user-friendly report-creation tool.
TOTEM explores time series unification through discrete tokens (not patches!!). Its simple VQVAE backbone learns a self-supervised, discrete, codebook in either a generalist (multiple domains) or specialist (1 domain) manner. TOTEMs codebook can then be tested on in domain or zero shot data with many 🔥 time series tasks.
Account Intelligence for Vertical Software
Uncover industry-specific data that will drive your next campaign.
A faster way to build and share data apps
Turn your data scripts into shareable web apps in minutes.All in pure Python. No front‑end experience required.
A new type of shell
Nu pipelines use structured data so you can safely select, filter, and sort the same way every time. Stop parsing strings and start solving problems.
a next-generation Python notebook
Explore data and build apps seamlessly with marimo, a next-generation Python notebook.
Amperity
Use AI to build and activate customer profiles in your data lakehouse — faster, smarter, and with much less work
Anaconda
Anaconda is the birthplace of Python data science. We are a movement of data scientists, data-driven enterprises, and open source communities.
The Best Place to Run Apache Airflow®
Take Apache Airflow® to the next level with Astro. From AI and Large Language Models to data-driven applications, Astronomer delivers reliability at any scale and accelerates innovation.
Find, Trust, and Govern AI-Ready Data
Provide trusted data with less effort and more adoption
Moving data. Powering innovation.
Effortlessly centralize all the data you need so your team can deliver better insights, faster. Start for free.
From data warehouse to a unified, AI-ready data platform
BigQuery is a fully managed, AI-ready data analytics platform that helps you maximize value from your data and is designed to be multi-engine, multi-format, and multi-cloud.
How the world collects public web data
Award winning proxy networks, powerful web scrapers, and ready-to-use datasets for download. Welcome to the world's #1 web data platform.
Bring everyone together with data.
From quick queries, to deep-dive analyses, to beautiful interactive data apps – all in one collaborative, AI-powered workspace.
Browser Automation and Dodge Bot Detectors
Scrape and automate any site. Scale your 1st party automations with our Browsers as a Service. Get past even the toughest detectors with our next-gen tech, BrowseQL.
Business Intelligence and Analytics Software
Vizualize analytics like you’ve never seen before.
Open Source AI Agents for Data Analysis
Ask questions to your enterprise data in natural language. Get real time data insights.
Modern monitoring & security
See metrics from all of your apps, tools & services in one place with Datadog’s cloud monitoring as a service solution. Try it for free.
Cyera
The data security solution you've been waiting for. Cyera enables you to discover and classify data, protect it from exposure, and maintain a resilient posture.
Data Agents
The DIY AI for Data Do it yourself doesn’t mean do it alone. Numbers Station’s agents are your partners in building AI-native data applications—smarter and faster. Prompt Box Search agent is thinking I...
Beautiful data visualizations to stunning data apps with AI
Discover data applications for production with Plotly Dash. Put data and AI into action with scalable, interactive data apps for your organization.
Own them all on the new data intelligence platform
Databricks brings AI to your data to help you bring AI to the world.
Datagrid
Agentic AI that runs on your data and systems, designed to enhance productivity and streamline workflows. Ensuring tasks are completed efficiently.
Datawrapper
Create interactive, responsive & beautiful data visualizations with the online tool Datawrapper — no code required.
A query engine that runs at ludicrous speed
Fast distributed SQL query engine for big data analytics that helps you explore your data universe.
Flourish Studio
Bring data to life with Flourish. Create data visualizations and interactive content – no coding needed. Engage, inspire, and tell your best data stories with ease.
Better Insights. Faster.
Heap is the only digital insights platform that shows everything users do on your site, revealing the "unknown unknowns" that stay invisible with other tools.
ETL, Data Integration & Data Pipeline Platform
Fully managed, no-code platform to automate data replication. Ingest, transform, and load data from 150+ sources. Get reliable data every time.
High-performance streaming data platform
Redpanda is a powerful, simple, and cost-efficient streaming data platform that is compatible with Kafka® APIs while eliminating Kafka complexity.
The standard for Identifying, Controlling, and Correcting Critical Data
The All-in-One Tool advancing data quality, consistency, and governance with simplicity and precision.
Build World Class AI Datasets. Together.
Manage your AI data using Oxen's state of the art data version control. Blazing fast, and Open source.
Website Heatmaps & Behavior Analytics Tools
The next best thing to sitting beside someone browsing your site. See where they click, ask what they think, and learn why they drop off. Get started for free.
Prioritize with confidence
Free Kano powered surveys and analyses to help you build your product roadmap. Get insights in what your customers value the most with a survey you can share in seconds.
Label & Curate Multimodal Data for AI
Manage, curate, and label multimodal data such as image, video, audio, document, text and DICOM files – all on one platform. Transform petabytes of unstructured data into high quality data for training, fine-tuning, and aligning AI models, fast.
Labelbox
Labelbox delivers the software and services to help you build, operate, or staff your data factory
Low-code programming for event-driven applications
The easiest way to collect, transform and visualize real-time data.
Data infrastructure, scaled for success
Pipekit is your partner in data infrastructure and scaling for data science, AI, and ML. We help teams go from notebooks to models serving billions of users. Build for success with Pipekit.
Business Intelligence built around data teams
Mode is a collaborative data platform that combines SQL, R, Python, and visual analytics in one place. Connect, analyze, and share, faster.
Your platform for AI and data pipelines.
Dagster is a unified control plane for teams to build, scale, and observe their AI & data pipelines with confidence.
News API – Search News and Blog Articles on the Web
Locate articles and breaking news headlines from news sources and blogs across the web with our JSON API
Oxen.ai
Manage your AI data using Oxen's state of the art data version control. Blazing fast, and Open source.
DataFrames for the new era
Polars is an open-source library for data manipulation, known for being one of the fastest data processing solutions on a single machine. It features a well-structured, typed API that is both expressive and easy to use.
Free software, open standards, and web services for interactive computing across all programming languages
The Jupyter Notebook is a web-based interactive computing platform. The notebook combines live code, equations, narrative text, visualizations, interactive dashboards and other media.
Simplify working and interacting with databases
Ship production apps at lightning speed, and scale to a global audience effortlessly with our next-gen serverless database.
The #1 Enterprise Data Platform for AI
We make enterprise data intelligent and responsive for AI. Build AI capabilities that can reason over enterprise data.
Make content your competitive advantage
Expect more from your CMS: Treat content as data—actionable, scalable, and ready to drive your business forward with Sanity Content Operating System.
ThoughtSpot
Transform insights into action with the ThoughtSpot Agentic Analytics Platform—AI agents, automated insights, and embedded intelligence.
Wayfare AI
The Enterprise DataOS - enabling data driven digitalisation and innovation in enterprises.