SFTP Bulk
This page contains the setup guide and reference information for the SFTP Bulk source connector.
This connector provides the following features not found in the standard SFTP source connector:
- Bulk ingestion of files: This connector can consolidate and process multiple files as a single data stream in your destination system.
- Incremental loading: This connector supports incremental loading, allowing you to sync files from the SFTP server to your destination based on their creation or last modification time.
- Load most recent file: You can choose to load only the most recent file from the designated folder path. This feature is particularly useful when dealing with snapshot files that are regularly added and contain the latest data.
Prerequisites
- Access to a remote server that supports SFTP
- Host address
- Valid username and password associated with the host server
Setup guide
Step 1: Set up SFTP authentication
To set up the SFTP connector, you will need to select at least one of the following authentication methods:
- Your username and password credentials associated with the server.
- A private/public key pair.
To set up key pair authentication, you may use the following steps as a guide:
-
Open your terminal or command prompt and use the
ssh-keygen
command to generate a new key pair.noteIf your operating system does not support the
ssh-keygen
command, you can use a third-party tool like PuTTYgen to generate the key pair instead. -
You will be prompted for a location to save the keys, and a passphrase to secure the private key. You can press enter to accept the default location and opt out of a passphrase if desired. Your two keys will be generated in the designated location as two separate files. The private key will usually be saved as
id_rsa
, while the public key will be saved with the.pub
extension (id_rsa.pub
). -
Use the
ssh-copy-id
command in your terminal to copy the public key to the server.
ssh-copy-id <username>@<server_ip_address>
Be sure to replace your specific values for your username and the server's IP address.
Depending on factors such as your operating system and the specific SSH implementation your remote server uses, you may not be able to use the ssh-copy-id
command. If so, please consult your server administrator for the appropriate steps to copy the public key to the server.
- You should now be able to connect to the server via the private key. You can test this by using the
ssh
command:
ssh <username>@<server_ip_address>
For more information on SSH key pair authentication, please refer to the official documentation.
Step 2: Set up the SFTP connector in Airbyte
- Log in to your Airbyte Cloud account, or navigate to your Airbyte Open Source dashboard.
- In the left navigation bar, click Sources. In the top-right corner, click + New source.
- Find and select SFTP from the list of available sources. For Airbyte Cloud users: If you do not see the SFTP Bulk source listed, please make sure the Alpha checkbox at the top of the page is checked.
- Enter a Source name of your choosing.
- Enter your Username, as well as the Host Address and Port. The default port for SFTP is 22. If your remote server is using a different port, please enter it here.
- Enter your authentication credentials for the SFTP server (Password or Private Key). If you are authenticating with a private key, you can upload the file containing the private key (usually named
rsa_id
) using the Upload file button. - Enter a Stream Name. This will be the name of the stream that will be outputted to your destination.
- Use the dropdown menu to select the File Type you wish to sync. Currently, only CSV and JSON formats are supported.
- Provide a Start Date using the provided datepicker, or by programmatically entering the date in the format
YYYY-MM-DDT00:00:00Z
. Incremental syncs will only sync files modified/added after this date. - If you wish to configure additional optional settings, please refer to the next section. Otherwise, click Set up source and wait for the tests to complete.