January 17, 2022: DataStation 0.6.0 is released! Read more
DataStation The Open-Source Data IDE

SQL IDE meets Computational Notebook.

  • search Load from databases, REST APIs and files
  • code Script in your favorite language
  • bar_chart Create beautiful visualizations
  • lock Data never leaves your machine
  • favorite Desktop app on Windows, macOS, or Linux

Turn scripts into dashboards and exports.

  • bar_chart Create reports on your data, wherever it is
  • schedule Schedule recurring exports of reports
  • code Customize everything through scripts
  • groups Work on projects as a team
  • favorite It's easy! No ETL processes required

Supported ____

Databases

  • SQLite
  • PostgreSQL
  • MySQL
  • SQL Server
  • Oracle
  • ClickHouse
  • Snowflake
  • Elasticsearch
  • InfluxDB
  • Prometheus

Languages

  • Python
  • Node.js
  • Ruby
  • R
  • Julia

File Types

  • Excel
  • CSV
  • JSON
  • JSON Newline
  • Parquet
  • Apache Access Logs
  • Apache Error Logs
  • NGINX Access Logs
  • Custom Regex

Why use DataStation over ____

a SQL GUI?

  • Query non-SQL databases
  • Query REST servers
  • Import files from disk
  • Script against results in your favorite scripting language
  • Host dashboards (in Server mode)
  • Recurring email exports (in Server mode)

Jupyter?

  • Easy installation. No server needed (unless you want it!)
  • Built-in integrations with databases, REST servers, files local or over SSH
  • Focus on the business logic of your reports, not connection code

custom scripts?

  • No need to look up how to connect to your database
  • No need to look up how to generate graphs
  • No need to figure out how to host every new report
  • No need to figure out how to set up recurring email exports of reports
  • Focus on the business logic of your reports

The Pitch

DataStation is built by Multiprocess Labs, run by myself, Phil Eaton. As a developer and engineering manager at Oracle and elsewhere, I found myself repeatedly creating custom scripts to pull data from multiple sources to produce useful reports for product and business teams. It always took me a long time to remember how to connect to all the different databases, how to graph results, how to host a new site for the report, how to send recurring exports of the report.

One way to solve this is to make sure all your data goes into a data warehouse. Then you can just use the data warehouse's dashboarding system to do all your work. But this requires entire organization buy-in both up front and for every new data source you want to add. And the warehouse software and ETL maintenance can be cost prohibitive for small organizations.

In contrast, DataStation gives you a smart, open-source data IDE where you can easily combine data using your existing credentials or API keys with no new ETL process. For truly massive reports, this may fall over. But the majority of business reports can be run without pulling terabytes (let alone gigabytes) of data onto a single machine. DataStation helps you out with those.

DataStation can be run as an easy-to-install desktop application on a single laptop. Or it can be run as a server instance where all data and scripts can be shared across teams. Either way, your credentials are always encrypted.

Phil phil@multiprocess.io

Subscribe to stay in touch!