Skip to content

Nhaaa4/Data-Engineering-Tools

Repository files navigation

Data Engineering Tools Installation Guides

This repository contains a collection of installation guides for essential tools used throughout the Data Engineering course. Each tool has its own dedicated Markdown file with clear setup steps, configuration notes, and troubleshooting tips.

Included Installation Guides

You will find step-by-step setup instructions for tools such as:

  1. Hadoop (HDFS & YARN)
  2. Apache Hive
  3. Apache Spark
  4. Airflow
  5. Kafka

Each guide is located in its own .md file for easy navigation and reference.

Purpose of This Repository

This repository is designed to:

  • Provide a standardized installation reference for students
  • Reduce setup issues during hands-on labs
  • Ensure consistency across different environments
  • Serve as a quick troubleshooting resource

How to Use

  • Browse to find the relevant installation file
  • Follow the steps in your selected tool’s .md guide
  • Use the troubleshooting notes at the bottom of each guide if things break
  • Continue to the next guide as required for your module

Contribution

If you find issues or want to improve a guide, feel free to open a pull request or submit an issue.

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published