Skip to main content

User Guide

Software Installation

For installation instructions, see "Quick Start".

Login

Enter the configured administrator username and password, then click the "Login" button to sign in. You can switch the interface language from the dropdown at the top-right of the login page. After login, you will land on the Dashboard by default.

cinfer-login

Dashboard

The Dashboard displays the environment information, resource usage, and model-related stats for the current service. It primarily includes:

  • Device Info: Basic information about the deployment device or server, including OS version, software name, inference resources (CPU/GPU, etc.), version number, as well as counts of current models and assigned models.
  • CPU Usage: CPU utilization of the service, showing the past 24 hours.
  • GPU Usage: GPU utilization of the service, showing the past 24 hours.
  • Memory Usage: Memory usage of the service, showing the past 24 hours.
  • Model Operating Status: Counts of published and unpublished models in Model Management.

cinfer-login

Model Management

In Model Management, you can upload, publish, and unpublish various AI models, and configure input/output parameters. Published models expose a unified OpenAPI for external applications to perform standardized AI inference calls.

model-management

Add Model

Cinfer allows users to upload their own trained ONNX models and manage them. Click the "Add" button on the page to open the form and upload a model. Parameter descriptions:

  • Name: Model name
  • Infer Engine Type: Model type; currently only ONNX is supported
  • Model File: Model source file; it must match the selected model type
  • Model Parameter: Model parameters used to define model inputs/outputs; can be customized for application integration and requires adaptation in the inference layer. For details on filling Model Parameter, see "Dev Guide".
  • Remark: Notes model-management model-management

Manage Models

model-management

Edit Model

While a model is unpublished, click the "Edit" button to reopen the form and modify model information. model-management

Delete Model

While a model is unpublished, click the "Delete" button and then "Delete" in the confirmation dialog to remove the model. model-management

Publish Model

After completing the add-model form and clicking "Save", a unique Model ID is generated and appears in the list. You can then edit, publish, or delete it. Click the middle button to publish the model; in the confirmation dialog, click "Confirm" to finalize publishing. After publishing, you can authorize clients to use this model in Token Management. If a token is configured with permissions for all models, the published model will be granted automatically. model-management

Unpublish Model

After publishing, you can unpublish a model. Note: This operation can impact existing applications or clients using the model service, potentially causing failures. Proceed with caution. model-management

Token Management

This module defines clients allowed to access the Cinfer service by assigning tokens. Token Management provides token creation and management, request IP whitelists, and API request limits. token-management

Add Token

Click the "Add" button in the menu to create a token. Form fields include:

  • Name: Token name; customizable
  • Model Permissions: The models this token may call. Currently supports all models or selecting specific published models. If "all models" is selected, newly published models will automatically be granted to tokens with this option.
  • Request Frequency Limit: Per-minute API request limit for anti-abuse. Requests beyond the limit are blocked. Applies to the total of all API requests.
  • Request Quantity Limit: Monthly API request limit to control usage. Requests beyond the limit are blocked. Applies to the total of all API requests.
  • IP Whitelist: IP allowlist; supports multiple entries separated by "," or ";". Only listed IPs may access services with the token.
  • Remark: Notes After completing the form, click "Save" to create the token. token-management After saving, a dialog shows the token. Click "Copy" to copy it. Keep it secure. You can later use this token to call AI services within its permission scope. token-management

Manage Tokens

token-management Tokens support delete, edit, and disable operations with the following controls:

  • Disable: Toggle the status switch in the list to enable/disable a token. Disabled tokens are denied access. Operate with caution.
  • Edit: Click the "Edit" button to modify model permissions, request limits, and the IP allowlist. This does not affect currently integrated clients.
  • Delete: Click the "Delete" button to remove a token. This operation is highly risky and will affect clients using the token. Avoid unless necessary. If confirmed, click "Delete" in the dialog. token-management