Embedded AI for Edge Devices (TensorRT/ONNX)

Published:

This project focuses on deploying computer vision models on edge devices under tight latency and compute constraints.

Embedded AI: Real-Time Instance Segmentation

  • Implemented and optimized two instance segmentation models: SparseInst and Yolov7.
  • Exported models and prepared deployment flows using ONNX.
  • Optimized for real-time execution on NVIDIA edge devices with CUDA TensorRT.

Practical emphasis

  • Converting research models into deployable artifacts
  • Latency/throughput tuning and performance validation
  • Deployment-oriented engineering (runtime, memory, and stability)