Hello, I'm

Kamal Lamichhane

|

Staff AI Software Engineer @ Qualcomm | AI Inference Acceleration | Generative AI | System ML/AI | Embedded Systems | ISO-26262 | Deep Learning

Kamal Lamichhane

About Me

AI & Generative AI

Proficient in cutting-edge AI and Generative AI technologies, working on Qualcomm AI Runtime Engine and AI Accelerator HW.

Transformer models (GPT/LLaMA), Deep Learning Frameworks

Embedded Systems & Automotive

Experienced in Embedded Systems, Automotive Perception Stack, ADAS solutions, and safety-critical systems development.

ISO-26262 Certified | Real-Time Systems Expert

Education & Research

MS in Computer Engineering from University of Waterloo. Published researcher in embedded systems and autonomous driving safety.

7+ Publications | Teaching Experience

0
Publications

(Selected shown)

0
Years Experience
0
Projects Completed
0
Students Taught

Core Skills

AI & Machine Learning

Deep Learning (TensorFlow, Keras, PyTorch) 95%
Generative AI & Transformer Models 92%
CNN, LSTM, Neural Networks 93%
Computer Vision & CUDA 90%

Programming Languages

C/C++ (AUTOSAR C++) 97%
Python 93%
Assembly & Firmware 88%
MATLAB/Simulink 85%

Embedded & Automotive

RTOS (FreeRTOS, VxWorks) 94%
ADAS Solutions & Perception 92%
ARM/AVR/NXP Microcontrollers 90%
ROS/ROS2 88%

Safety & Standards

ISO 26262 (Functional Safety) 95%
AUTOSAR C++ Compliance 92%
FMEA & FTA Analysis 90%

Communication Protocols

CAN, I2C, SPI 92%
TCP/IP, Ethernet 90%
RS232, RS485 88%

Tools & Platforms

Git, CI/CD, DevOps 90%
Linux/Unix Shell Scripting 88%
Qt, Web Development 85%

Work Experience

Staff AI Software Engineer

Qualcomm · Dec 2024 – Present

Leading Qualcomm AI Runtime Engine and QAIRT features for accelerator hardware with AUTOSAR C++ compliant SDK.

View details →

Senior AI Software Engineer

Qualcomm · Aug 2022 – Nov 2024

Developed Qualcomm AI Engine Direct SDK and ML operators, enabling efficient deployment on Qualcomm AI accelerator hardware.

View details →

Adjunct Professor

Seneca College · May 2022 – Aug 2022

Taught programming courses in C++, Python, and ROS, bridging industry experience with applied engineering education.

View details →

Senior Software Developer, ADAS/AD Solutions

LeddarTech · Jun 2019 – Aug 2022

Built ADAS perception software and deep learning pipelines for object detection from point cloud data and camera sensors.

View details →

Embedded Software Engineer

Ciena · Feb 2018 – May 2019

Developed embedded middleware and debugging frameworks for Ciena 6500 packet-optical platforms on Linux.

View details →

Functional Safety / Software Safety Engineer

Ricardo (Peloton Technology) · May 2017 – Dec 2017

Executed ISO‑26262 safety activities including FMEA and FTA for autonomous truck platooning and automotive systems.

View details →

Graduate Teaching Assistant

University of Waterloo · Sep 2016 – Aug 2017

Supported courses in operating systems and embedded systems, including lab design and a small real-time executive (RTX).

View details →

Embedded Software Engineer

Magna Electronics · 2015 – 2017

Worked on parking slot detection and camera module safety for Magna Auto Parking systems using C/C++ and computer vision.

View details →

AI & ML Projects

AI Neural Network
Qualcomm AI

Qualcomm AI Runtime Engine (QAIRT)

Development of AI Runtime Engine for Qualcomm's AI Accelerator Hardware. Feature development, optimization, and AUTOSAR C++ compliance for QNN SDK.

Neural Networks AI Accelerator C++
Generative AI
Generative AI

Transformer Models & LLMs

Working with state-of-the-art transformer models including GPT and LLaMA architectures. Deep learning frameworks and ML systems optimization.

GPT LLaMA Transformers
Computer Vision
Computer Vision

ADAS & Object Detection

Deep learning for pedestrian, car, and object detection from point cloud data. Computer vision applications for autonomous driving systems.

CNN CUDA Point Cloud
ML Operators
ML Engineering

ML Operators & CI/CD

Machine learning operators development and ML testing infrastructure. Continuous integration and deployment pipelines for AI systems.

TensorFlow CI/CD Testing
AI Chip
Hardware AI

AI on Snapdragon SoCs

AI development on Snapdragon System-on-Chips. Optimizing neural networks for mobile and embedded AI accelerators.

Snapdragon SoC Optimization
Deep Learning
Deep Learning

CNN & LSTM Networks

Advanced deep learning with Convolutional and Long Short-Term Memory networks. TensorFlow, Keras, and PyTorch implementations.

CNN LSTM Keras

Education & Certifications

University of Waterloo
GPA: 3.9/4.0

University of Waterloo

Master of Science (MS), Computer Engineering

Specialized in Real-Time Embedded Systems and Software. Worked on specification-based bug detection for embedded software and machine learning projects. Published papers in embedded systems and safety analysis for autonomous driving.

Real-Time Systems Machine Learning Published Research
NMIT Bangalore
GPA: 9.73/10

Nitte Meenakshi Institute of Technology

Bachelor of Engineering (B.E.), Electrical & Electronics Engineering

Worked on command and data handling for Project STUDSAT-2. Developed a real-time operating system using FreeRTOS for satellite operations.

STUDSAT-2 Project RTOS Development Satellite Systems
Professional Certifications
Certified

Professional Certifications

ISO-26262 Functional Safety & Full Stack Development

ISO-26262 Automotive Functional Safety certified. Full Stack Web Developer Certification from Free Code Camp.

ISO-26262 Full Stack Dev Safety Analysis

Research Projects

Embedded Systems

Non-Intrusive Program Tracing

Advanced debugging techniques for embedded systems at deployment stage, enabling real-time monitoring without standard debugging tools.

2019 Debugging
Space Systems

RTOS for STUDSAT-2

Implemented Real-Time Operating System using FreeRTOS with 64 uniquely identified tasks for twin nano-satellite operations.

2015 IEEE Published
RF Engineering

Wideband Antenna Design

Designed, simulated and tested four wideband antennas for aircraft applications including angle of arrival estimation and adaptive nulling.

2014 RF Design

Selected Publications

01

Embedded RTOS Implementation for Twin Nano-Satellite STUDSAT-2

Kamal Lamichhane, Kiran M, Kannan T, Sandya S

IEEE International Conference on Metrology for Aerospace, 2015

02

Operational Flow for Twin Nano-Satellite Mission

Kamal Lamichhane, Kiran M, Kannan T, Sandya S

IEEE International Conference on Metrology for Aerospace, 2015

03

Actuation System Design and Payload Operation Flow for STUDSAT-2

Sandesh Hegde, Kannan T, Kamal Lamichhane, Sandya S

66th International Astronautical Congress, 2015

04

Performance Analysis Of Rectangular Patch Antenna Using Quarter Wave Feed Line

Dr. H.C.Nagaraj, Dr.T.S.Rukmini, Mr.Prasanna Paga, Kamal Lamichhane

International Conference on Academic Research in Engineering, Science And Technology, 2015

05

Implementation and Comparative Study of Algorithms to Avoid Obstacles in Mobile Robot Navigation

Min Raj Nepali, Amar Mani Aryal, Ashutosh, Kamal Lamichhane

International Journal of Computer Applications 97(11):13-18, July 2014

06

Early Breast Cancer Detection Using Statistical Parameters

H. C. Nagaraj, Prasanna Paga, Kamal Lamichhane

International Journal of Research in Engineering & Technology (IMPACT: IJRET), Vol. 2, Issue 3, Mar 2014

07

IP Based Distributed Smart Camera Surveillance System for Forest Application

Kamal Lamichhane, Shreeyak S. Sajjan, Huggi Pooja, Rajesh N

International Journal of Electronics Engineering, ISSN: 0973-7383, No.5 (2013) Issue No.: 2 (2013)

08

Non-intrusive program tracing of non-preemptive multitasking systems using power consumption

K. Lamichhane, C. Moreno, S. Fischmeister

2018 Design, Automation & Test in Europe Conference & Exhibition (DATE), Dresden, Germany, 2018, pp. 1147-1150, doi: 10.23919/DATE.2018.8342184

Blog & Articles

Edge AI
Technical
January 15, 2026 10 min read Edge AI

Edge AI Optimization: Bringing Intelligence to Resource-Constrained Devices

Learn about model compression techniques, efficient architectures, runtime optimization, and hardware acceleration strategies that enable sophisticated AI on smartphones, IoT devices, and embedded systems.

Quantization Pruning Mobile AI NPUs
Read Full Article
Transformers
Deep Dive
January 10, 2026 15 min read Deep Learning

Understanding Transformer Models: The Architecture That Changed AI

A comprehensive exploration of transformer architecture, from self-attention mechanisms to modern variants like GPT and BERT. Covers training techniques, efficiency improvements, and applications beyond NLP.

Attention BERT GPT Vision
Read Full Article

Testimonials & Recommendations

Department Chair

Seneca College

"As an adjunct professor, Kamal demonstrated exceptional teaching abilities and deep technical knowledge. His students consistently praised his clear explanations of complex concepts and his dedication to their success. He brings real-world industry experience into the classroom."

Graduate Student

University of Waterloo

"Kamal's guidance as a teaching assistant was invaluable. His expertise in embedded systems and real-time operating systems helped me understand complex concepts. He's patient, knowledgeable, and always willing to help students succeed."

Software Engineer

LeddarTech

"Working with Kamal on ADAS solutions was a great experience. His expertise in computer vision and deep learning, combined with his strong C++ skills, made him an invaluable team member. He consistently delivered high-quality, production-ready code."

Let's Connect