[QDP] PyTorch CUDA stream‑aware encode for GPU tensors #930

400Ping · 2026-01-23T15:16:55Z

Purpose of PR

lib.rs: obtain PyTorch’s current CUDA stream pointer and route CUDA-tensor encoding through stream-aware core APIs to prevent cross-stream races.
lib.rs: add encode_from_gpu_ptr_with_stream and encode_batch_from_gpu_ptr_with_stream, and synchronize on the provided stream; keep default-stream entry points intact.
amplitude.rs: add a stream-aware GPU norm path so normalization and kernels run on the same stream before host reads.

Related Issues or PRs

Related to #726

Changes Made

Breaking Changes

Yes
No

Checklist

Added or updated unit tests for all changes
Added or updated documentation for all changes
Successfully built and ran all unit tests or manual tests locally
PR title follows "MAHOUT-XXX: Brief Description" format (if related to an issue)
Code follows ASF guidelines

Signed-off-by: 400Ping <fourhundredping@gmail.com>

rich7420 · 2026-01-24T16:06:15Z

@400Ping thanks for the patch!
plz fix ci errors and conflicts.

400Ping · 2026-01-25T10:56:16Z

@CheyuWu PTAL

Signed-off-by: 400Ping <fourhundredping@gmail.com>

rich7420

@400Ping thanks for the patch!
overall LGTM
But I think we could add more unit tests for new feature here.

qdp/qdp-python/src/lib.rs

Signed-off-by: 400Ping <jiekaichang@apache.org>

rich7420 · 2026-01-29T06:24:16Z

I think in this pr should add tests for the feature. like I mentioned previously.

viiccwen

overall LGTM!
left comments.

qdp/qdp-core/src/gpu/encodings/amplitude.rs

Signed-off-by: 400Ping <jiekaichang@apache.org>

GPU pointer validation stream sync

b3c8829

Signed-off-by: 400Ping <fourhundredping@gmail.com>

400Ping changed the title ~~[QDP] GPU pointer validation stream sync~~ [QDP] GPU Pointer Validation Stream Sync Jan 23, 2026

400Ping closed this Jan 23, 2026

400Ping reopened this Jan 23, 2026

400Ping changed the title ~~[QDP] GPU Pointer Validation Stream Sync~~ [QDP] PyTorch CUDA stream‑aware encode for GPU tensors Jan 23, 2026

400Ping added 3 commits January 25, 2026 19:04

fix

f8d6307

Signed-off-by: 400Ping <fourhundredping@gmail.com>

Merge branch 'main' into qdp/pytorch-direct-gpu-stream-sync

4a762f9

fix ci error

f0e0783

Signed-off-by: 400Ping <fourhundredping@gmail.com>

400Ping requested review from guan404ming, rich7420 and ryankert01 January 26, 2026 16:11

rich7420 reviewed Jan 27, 2026

View reviewed changes

qdp/qdp-python/src/lib.rs Show resolved Hide resolved

CheyuWu approved these changes Jan 27, 2026

View reviewed changes

400Ping force-pushed the main branch from 0efbb29 to 6bf1c74 Compare January 27, 2026 11:49

fix

4b71ad0

Signed-off-by: 400Ping <jiekaichang@apache.org>

400Ping requested a review from rich7420 January 28, 2026 16:05

guan404ming added this to the Qumat 0.5.1 milestone Jan 29, 2026

viiccwen reviewed Jan 29, 2026

View reviewed changes

qdp/qdp-core/src/gpu/encodings/amplitude.rs Outdated Show resolved Hide resolved

cursor bot force-pushed the qdp/pytorch-direct-gpu-stream-sync branch from cd52809 to 4b71ad0 Compare January 29, 2026 10:05

400Ping force-pushed the qdp/pytorch-direct-gpu-stream-sync branch from cd52809 to 4b71ad0 Compare January 29, 2026 11:41

400Ping added 5 commits January 29, 2026 19:47

fix conflicts

d9541ed

Signed-off-by: 400Ping <jiekaichang@apache.org>

update

7e49c2b

Signed-off-by: 400Ping <jiekaichang@apache.org>

fix conflicts

398c74c

Signed-off-by: 400Ping <jiekaichang@apache.org>

Merge branch 'main' into qdp/pytorch-direct-gpu-stream-sync

2b39850

fix pre-commit

211cb52

Signed-off-by: 400Ping <jiekaichang@apache.org>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[QDP] PyTorch CUDA stream‑aware encode for GPU tensors #930

[QDP] PyTorch CUDA stream‑aware encode for GPU tensors #930

400Ping commented Jan 23, 2026

Uh oh!

rich7420 commented Jan 24, 2026

Uh oh!

400Ping commented Jan 25, 2026

Uh oh!

rich7420 left a comment

Uh oh!

Uh oh!

rich7420 commented Jan 29, 2026 •

edited

Loading

Uh oh!

viiccwen left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

[QDP] PyTorch CUDA stream‑aware encode for GPU tensors #930

Are you sure you want to change the base?

[QDP] PyTorch CUDA stream‑aware encode for GPU tensors #930

Conversation

400Ping commented Jan 23, 2026

Purpose of PR

Related Issues or PRs

Changes Made

Breaking Changes

Checklist

Uh oh!

rich7420 commented Jan 24, 2026

Uh oh!

400Ping commented Jan 25, 2026

Uh oh!

rich7420 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

rich7420 commented Jan 29, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

viiccwen left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

rich7420 commented Jan 29, 2026 •

edited

Loading