Embedded AI for Edge Devices (TensorRT/ONNX)

Published: January 01, 2023

This project focuses on deploying computer vision models on edge devices under tight latency and compute constraints.

Embedded AI: Real-Time Instance Segmentation

Implemented and optimized two instance segmentation models: SparseInst and Yolov7.
Exported models and prepared deployment flows using ONNX.
Optimized for real-time execution on NVIDIA edge devices with CUDA TensorRT.

Practical emphasis

Converting research models into deployable artifacts
Latency/throughput tuning and performance validation
Deployment-oriented engineering (runtime, memory, and stability)

Share on

Bluesky Facebook LinkedIn Mastodon X (formerly Twitter)