How to install Airflow on Ubuntu (Apache Airflow installation steps)

Apache Airflow is an open-source platform for orchestrating complex workflows and data pipelines. It allows users to schedule, monitor, and manage workflows through a programmable and extensible framework. Airflow is particularly well-suited for handling tasks related to data processing, ETL (Extract, Transform, Load), and other automation scenarios.

Installing Airflow on Ubuntu

To install Apache Airflow on Ubuntu, you can follow these general steps.

Install Dependencies: Ensure that you have Python and pip installed on your system. Additionally, you may need to install other dependencies:

sudo apt update
sudo apt install python3 python3-pip

Install Airflow: You can install Airflow using pip:

pip3 install apache-airflow

Initialize the Airflow Database: Airflow uses a database to store metadata. Initialize the database using the following command:

airflow db init

Start the Web Server: Start the Airflow web server to access the web interface:

airflow webserver --port 8080

Start the Scheduler: Open a new terminal and start the scheduler:

airflow scheduler

Access the Web Interface: Open a web browser and navigate to https://localhost:8080. You should see the Airflow web interface.

You’ve successfully installed Apache Airflow on your Ubuntu system. You can now create and manage your workflows using the Airflow web interface.

Related How-To guides: