- π₯ Multimodal Video Captioning - Audio-Visual understanding
- ποΈ Computer Vision - 3D Reconstruction, Pose Estimation
- π€ Vision Transformers - Attention mechanisms for visual tasks
- π Deep Learning Research - PyTorch implementations
- π Portfolio | π§ ashokbk215@gmail.com
-
00:28
(UTC +05:45) - github.com/blazewild
- https://www.asokbk.com.np/
- in/asokbk
Highlights
- Pro
Pinned Loading
-
Real-Time-Motion-Transfer-to-a-3D-Avatar
Real-Time-Motion-Transfer-to-a-3D-Avatar PublicReal-time human pose detection and motion transfer to 3D avatars using MediaPipe, DNN, and Three.js β supports webcam and video inputs with custom avatar integration.
-
Custom_LLM_DataGen_Template
Custom_LLM_DataGen_Template Publicπ§ Modular pipeline for generating high-quality, domain-specific datasets for LLM fine-tuning β from PDFs and web scraping to synthetic Q&A generation, quality filtering, and training-ready formatting.
-
TrekNepal-3B__Finetuned-Llama3.2-3B
TrekNepal-3B__Finetuned-Llama3.2-3B PublicFine-tuning pipeline for LLaMA 3.2-3B on Nepal trekking using custom synthetic Q&A data, LLM-based filtering, and QLoRA optimization.
Python
-
Hav-Cocap
Hav-Cocap PublicHav-Cocap: Hybrid Audio-Visual Compressed Video Captioning framework. Extends CoCap with an Audio Encoder and evaluated on the AVCaps dataset.
Jupyter Notebook
If the problem persists, check the GitHub status page or contact support.


