Embulk is an Open-source Pluggable Bulk Data Loaderto/from varieties of storages, file formats, databases, cloud services, and else.
Latest Versions
Look into this article on the stable and development versions.
Recent Articles
Embulk is a bulk data loader. It helps data transfer between types of databases, storages, file formats, cloud services, and else.
Embulk supports:
- Combination of input and output from varieties of plugins
- Plugins released in Maven and Ruby gem repositories
- Automated guess of input file formats
- Parallel execution to deal with big data sets
- Transaction control to guarantee all-or-nothing
Community
- For Users
- User Forum: GitHub Discussions
- For Developers
- Chat: Zulip at https://embulk-dev.zulipchat.com/
- Developer Resources: https://dev.embulk.org/
Quick Start
Embulk runs on Java. Make sure Java Runtime is installed.
Embulk v0.9 and v0.10 run on Java 8. Java 9 is not supported officially.
Linux & macOS
curl --create-dirs -o ~/.embulk/bin/embulk -L "https://dl.embulk.org/embulk-latest.jar"
chmod +x ~/.embulk/bin/embulk
echo 'export PATH="$HOME/.embulk/bin:$PATH"' >> ~/.bashrc
source ~/.bashrc
Windows
PowerShell -Command "& {[Net.ServicePointManager]::SecurityProtocol = [Net.SecurityProtocolType]::TLS12; Invoke-WebRequest http://dl.embulk.org/embulk-latest.jar -OutFile embulk.bat}"
Next steps
The embulk example
command generates an example CSV file.
You can try Embulk quickly by:
embulk example ./try1
embulk guess ./try1/seed.yml -o config.yml
embulk preview config.yml
embulk run config.yml