HueClientRest

A Python REST client for interacting with Hadoop Hue's REST API. This client allows you to execute SQL queries, manage files, and download results through Hue's web interface programmatically. With support of Quadient

Features

Authentication: JWT token-based authentication
Query Execution: Execute SQL queries with various dialects (Hive, Spark SQL, etc.)
Result Management: Fetch query results with pagination support
File Operations: List, download, and manage files in remote storage
CSV Export: Save query results directly to CSV files
Batch Operations: Execute queries and download resulting files in one operation

Installation

pip install hueclientrest

Quick Start

from hueclientrest import HueClientREST

# Initialize the client
client = HueClientREST(
    host="https://your-hue-server.com",
    username="your_username",
    password="your_password",
    verify_ssl=True
)

# Execute a query and save results to CSV
client.run(
    statement="SELECT * FROM your_table LIMIT 100",
    dialect="hive",
    filename="results.csv"
)

Usage Examples

Basic Query Execution

from hueclientrest import HueClientREST

client = HueClientREST(
    host="https://your-hue-server.com",
    username="your_username",
    password="your_password"
)

# Simple query execution with CSV output
client.run(
    statement="SELECT count(*) FROM sales WHERE date >= '2023-01-01'",
    dialect="hive",
    filename="sales_count.csv"
)

Advanced Query Execution

# Step-by-step execution for more control
client.login()

# Execute query
operation_id = client.execute(
    statement="SELECT * FROM large_table WHERE condition = 'value'",
    dialect="spark"
)

# Wait for completion
client.wait(operation_id, poll_interval=5, timeout=600)

# Fetch results with custom batch size
headers, rows = client.fetch_all(operation_id, batch_size=5000)

# Save to CSV
client.save_csv(headers, rows, "large_results.csv")

File Operations

# List files in a directory
files = client.list_directory("/user/data/exports")
for file_info in files:
    print(f"Name: {file_info['name']}, Size: {file_info.get('size', 'N/A')}")

# Download a specific file
client.download_file(
    file_path="/user/data/exports/report.csv",
    local_filename="./downloads/report.csv"
)

# Download all files from a directory
downloaded_files = client.download_directory_files(
    directory_path="/user/data/exports",
    local_dir="./downloads",
    file_pattern="part-"  # Only download files containing "part-"
)

# Upload file
response = client.upload('/user/uploads', '.import.csv')

Export Query Results to Files

# Execute INSERT OVERWRITE DIRECTORY and download results
statement = """
INSERT OVERWRITE DIRECTORY '/user/exports/sales_2023'
STORED AS TEXTFILE
SELECT * FROM sales WHERE year = 2023
"""

downloaded_files = client.run_and_download(
    statement=statement,
    directory_path="/user/exports/sales_2023",
    local_dir="./sales_data",
    dialect="hive",
    file_pattern="part-",
    timeout=900  # 15 minutes timeout
)

print(f"Downloaded {len(downloaded_files)} files")

Working with Different SQL Dialects

# Hive query
client.run(
    statement="SHOW TABLES",
    dialect="hive",
    filename="hive_tables.csv"
)

# Spark SQL query
client.run(
    statement="SELECT spark_version()",
    dialect="sparksql",
    filename="spark_version.csv"
)

# Impala query
client.run(
    statement="SELECT version()",
    dialect="impala",
    filename="impala_version.csv"
)

SSL Configuration

# Disable SSL verification (not recommended for production)
client = HueClientREST(
    host="https://your-hue-server.com",
    username="your_username",
    password="your_password",
    verify_ssl=False,
    ssl_warnings=False  # Suppress SSL warnings
)

# Custom SSL verification
client = HueClientREST(
    host="https://your-hue-server.com",
    username="your_username",
    password="your_password",
    verify_ssl=True  # Use system CA bundle
)

Error Handling

from hueclientrest import HueClientREST

client = HueClientREST(
    host="https://your-hue-server.com",
    username="your_username",
    password="your_password"
)

try:
    client.run(
        statement="SELECT * FROM non_existent_table",
        dialect="hive",
        filename="results.csv"
    )
except RuntimeError as e:
    print(f"Authentication or execution error: {e}")
except TimeoutError as e:
    print(f"Query timed out: {e}")
except ValueError as e:
    print(f"Invalid response format: {e}")
except Exception as e:
    print(f"Unexpected error: {e}")

Batch Processing

queries = [
    ("SELECT count(*) FROM table1", "table1_count.csv"),
    ("SELECT count(*) FROM table2", "table2_count.csv"),
    ("SELECT count(*) FROM table3", "table3_count.csv"),
]

client = HueClientREST(
    host="https://your-hue-server.com",
    username="your_username",
    password="your_password"
)

# Login once for all queries
client.login()

for query, filename in queries:
    try:
        operation_id = client.execute(query, "hive")
        client.wait(operation_id)
        headers, rows = client.fetch_all(operation_id)
        client.save_csv(headers, rows, filename)
        print(f"Completed: {filename}")
    except Exception as e:
        print(f"Failed {filename}: {e}")

Unit tests

python -m unittest

API Reference

`HueClientREST`

Main client class for interacting with Hue REST API.

Constructor Parameters

host (str): Hue server URL
username (str): Username for authentication
password (str): Password for authentication
verify_ssl (bool): Whether to verify SSL certificates (default: True)
ssl_warnings (bool): Whether to show SSL warnings (default: False)

Methods

login(): Authenticate and obtain JWT token
execute(statement, dialect): Execute SQL statement
wait(operation_id, poll_interval, timeout): Wait for operation completion
fetch_all(operation_id, batch_size): Fetch all query results
save_csv(headers, rows, filename): Save results to CSV file
run(statement, dialect, filename, batch_size): Execute query and save to CSV
list_directory(directory_path, pagesize): List directory contents
download_file(file_path, local_filename): Download single file
download_directory_files(directory_path, local_dir, file_pattern): Download multiple files
run_and_download(statement, directory_path, local_dir, ...): Execute and download results
check_directory_exists(directory_path): Check if directory exists
upload_file(dest_path, file_path): Upload a file to a directory

Requirements

Python 3.7+
requests
urllib3

License

This project is licensed under the GNU General Public License v3.0 - see the LICENSE file for details.

Contributing

Fork the repository
Create a feature branch
Make your changes
Add tests if applicable
Submit a pull request

Support

For issues and questions, please create an issue in the GitHub repository.

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
src/hueclientrest		src/hueclientrest
test		test
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

HueClientRest

Features

Installation

Quick Start

Usage Examples

Basic Query Execution

Advanced Query Execution

File Operations

Export Query Results to Files

Working with Different SQL Dialects

SSL Configuration

Error Handling

Batch Processing

Unit tests

API Reference

`HueClientREST`

Constructor Parameters

Methods

Requirements

License

Contributing

Support

About

Uh oh!

Releases 1

Packages

Contributors 2

Uh oh!

Languages

License

SpanishST/HueClientRest

Folders and files

Latest commit

History

Repository files navigation

HueClientRest

Features

Installation

Quick Start

Usage Examples

Basic Query Execution

Advanced Query Execution

File Operations

Export Query Results to Files

Working with Different SQL Dialects

SSL Configuration

Error Handling

Batch Processing

Unit tests

API Reference

HueClientREST

Constructor Parameters

Methods

Requirements

License

Contributing

Support

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Contributors 2

Uh oh!

Languages

`HueClientREST`

Packages