embulk

Embulk is an Open-source Pluggable Bulk Data Loader
to/from varieties of storages, file formats, databases, cloud services, and else.

Latest Versions

Stable
v0.9.25
Development (unstable, but Release Candidate for next stable v0.11)
v0.10.49

Look into this article on the stable and development versions.

Recent Articles


Embulk is a bulk data loader. It helps data transfer between types of databases, storages, file formats, cloud services, and else.

Embulk supports:

  • Combination of input and output from varieties of plugins
  • Plugins released in Maven and Ruby gem repositories
  • Automated guess of input file formats
  • Parallel execution to deal with big data sets
  • Transaction control to guarantee all-or-nothing

Community

For Users
User Forum: GitHub Discussions
For Developers
Chat: Zulip at https://embulk-dev.zulipchat.com/
Developer Resources: https://dev.embulk.org/

Quick Start

Embulk runs on Java. Make sure Java Runtime is installed.

Embulk v0.9 and v0.10 run on Java 8. Java 9 is not supported officially.

Linux & macOS

curl --create-dirs -o ~/.embulk/bin/embulk -L "https://dl.embulk.org/embulk-latest.jar"
chmod +x ~/.embulk/bin/embulk
echo 'export PATH="$HOME/.embulk/bin:$PATH"' >> ~/.bashrc
source ~/.bashrc

Windows

PowerShell -Command "& {[Net.ServicePointManager]::SecurityProtocol = [Net.SecurityProtocolType]::TLS12; Invoke-WebRequest http://dl.embulk.org/embulk-latest.jar -OutFile embulk.bat}"

Next steps

The embulk example command generates an example CSV file.

You can try Embulk quickly by:

embulk example ./try1
embulk guess ./try1/seed.yml -o config.yml
embulk preview config.yml
embulk run config.yml

Presentation