Create an Apache HDFS source connection in the UI

NOTE
The Apache HDFS connector is in beta. See the Sources overview for more information on using beta-labeled connectors.

Source connectors in Adobe Experience Platform provide the ability to ingest externally sourced data on a scheduled basis. This tutorial provides steps for authenticating an Apache Hadoop Distributed File System (hereinafter referred to as “HDFS”) source connector using the Platform user interface.

Getting started

This tutorial requires a working understanding of the following components of Platform:

If you already have a valid HDFS connection, you may skip the remainder of this document and proceed to the tutorial on configuring a dataflow.

Gather required credentials

In order to authenticate your HDFS source connector, you must provide values for the following connection property:

Credential
Description
url
The URL defines auth params required for connecting to HDFS anonymously. For more information on how to obtain this value, refer to the following document on HTTPS authentication for HDFS.

Connect your HDFS account

Once you have gathered your required credentials, you can follow the steps below to link your HDFS account to Platform.

Log in to Adobe Experience Platform and then select Sources from the left navigation bar to access the Sources workspace. The Catalog screen displays a variety of sources for which you can create an account with.

You can select the appropriate category from the catalog on the left-hand side of your screen. Alternatively, you can find the specific source you wish to work with using the search option.

Under the Cloud storage category, select Apache HDFS. If this is your first time using this connector, select Configure. Otherwise, select Add data to create a new HDFS connector.

catalog

The Connect to HDFS page appears. On this page, you can either use new credentials or existing credentials.

New account

If you are using new credentials, select New account. On the input form that appears, provide a name, an optional description, and your HDFS credentials. When finished, select Connect to source and then allow some time for the new connection to establish.

connect

Existing account

To connect an existing account, select the HDFS account you want to connect with, then select Next to proceed.

existing

Next steps

By following this tutorial, you have established a connection to your HDFS account. You can now continue on to the next tutorial and configure a dataflow to bring data from your cloud storage into Platform.

recommendation-more-help
337b99bb-92fb-42ae-b6b7-c7042161d089