Embulk is an Open-source Pluggable Bulk Data Loader
to/from varieties of storages, file formats, databases, cloud services, and else.

Latest Versions

Development (unstable)

Look into this article on the stable and development versions.

Recent Articles

Embulk is a bulk data loader. It helps data transfer between types of databases, storages, file formats, cloud services, and else.

Embulk supports:

  • Combination of input and output from varieties of plugins
  • Plugins released in Maven and Ruby gem repositories
  • Automated guess of input file formats
  • Parallel execution to deal with big data sets
  • Transaction control to guarantee all-or-nothing


For Users
User Forum: GitHub Discussions
For Developers
Chat: Zulip at https://embulk-dev.zulipchat.com/
Developer Resources: https://dev.embulk.org/

Quick Start

Embulk runs on Java. Make sure Java Runtime is installed.

Embulk v0.9 and v0.10 run on Java 8. Java 9 is not supported officially.

Linux & macOS

curl --create-dirs -o ~/.embulk/bin/embulk -L "https://dl.embulk.org/embulk-latest.jar"
chmod +x ~/.embulk/bin/embulk
echo 'export PATH="$HOME/.embulk/bin:$PATH"' >> ~/.bashrc
source ~/.bashrc


PowerShell -Command "& {[Net.ServicePointManager]::SecurityProtocol = [Net.SecurityProtocolType]::TLS12; Invoke-WebRequest http://dl.embulk.org/embulk-latest.jar -OutFile embulk.bat}"

Next steps

The embulk example command generates an example CSV file.

You can try Embulk quickly by:

embulk example ./try1
embulk guess ./try1/seed.yml -o config.yml
embulk preview config.yml
embulk run config.yml