Skip to content

Tutorial

AI Fusion Kit Quick Start Guide

AVerMedia AI Fusion Kit is an all-in-one solution for LLM/VLM developers. It consists of a powerful AI box PC, a 4K camera, and an AI speakerphone, allowing you to easily build your own multimodal AI applications. This guide will walk you through the steps to get started with the AI Fusion Kit.

Time to First Token

Time to First Token (TTFT) refers to the latency between a user hit the Enter key and the appearance of the first character shows on the screen. Excessive TTFT can greatly diminish the overall user experience.

TTFT is a crucial response time indicator for an online interactive application powered by a large language model (LLM), as it reflects how quickly users can catch the first character from the model through a web page.

Here, we will explore two simple ways to get the latency of first token from a language model.

How to Setup QL601 Development Environment

QL601 is a powerful single-board computer equipped with Qualcomm® QCS6490 chipset, along with AVerMedia software stack, helping developers to build AI-powered multimedia applications.

In this tutorial, we will guide you through the steps to set up the QL601 development environment, helping you to get started with the QL601 quickly.

How to Download Qualcomm AI Hub Models

Qualcomm AI Hub provides various AI models optimized for Qualcomm devices. This guide introduces two methods for downloading these models:

  • Through the Qualcomm AI Hub website.
  • Using the Python package qai-hub-models.

You'll learn how to access all the Qualcomm-provided models, including those with licensing restrictions like YOLOv8 and YOLOv11.