Articles in category "Large Language Model"

Experimental Measurement of Z-Image: An Efficient Image Generation Model with 6B Parameters

2025-12-19 342 views Large Language Model Artificial Intelligence Image Generation

Z-Image is an efficient image generation model with 6B parameters, achieving or even surpassing the performance of mainstream competitive models with 8 inference steps (8 NFEs). It can run smoothly on consumer-grade devices with 16G VRAM. The model has three variants: Turbo (lightweight and real-time, suitable for AIGC applications and mini-programs), Base (undistilled for secondary fine-tuning), and Edit (specialized for image editing), with Turbo being the most valuable for practical deployment. In practical tests, the generation time for 1024×1024 resolution is 0.8 seconds (with Flash Attention + model compilation), and the peak memory usage is 14G. Technically, its S3-DiT architecture enhances parameter efficiency, the Decoupled-DMD distillation algorithm enables 8-step inference, and DMDR fuses RL and DMD to optimize quality. Its strengths lie in bilingual text rendering, photorealistic generation, low-VRAM deployment, and image editing. Limitations include only Turbo being open, and the need for optimization in extreme stylized generation and model compilation time. Z-Image balances performance, efficiency, and practicality, making it suitable for small and medium-sized teams and developers to reduce deployment barriers.

Deploying Baidu Wenxin 4.5 Open-Source Model for Android Device Calls

2025-07-05 355 views PaddlePaddle Large Language Model Artificial Intelligence PaddlePaddle Large Model

In the previous article "Usage and Deployment of the ERNIE 4.5 Open-Source Large Model", we introduced how to use FastDeploy to deploy the ERNIE 4.5 open-source large model and briefly called its interface. This article will describe how Android can call this deployed interface and implement conversations.

Quickly Deploy a DeepSeek-R1 Service from Scratch

2025-04-05 596 views Large Language Model Artificial Intelligence Large Language Model DeepSeek vllm

Here are the simplest commands to introduce how to deploy the DeepSeek-R1 service. Anaconda is assumed to be already installed, and the vllm framework is used, making it easy to deploy even in China.

Text Endpoint Detection Based on Large Language Models

2025-01-18 645 views Large Language Model Artificial Intelligence language model Natural Language Processing

This paper introduces a method to detect text endpoints using large language models (LLMs) to improve Voice Activity Detection (VAD) in voice conversations. By training a fine-tuned model to predict whether a sentence is complete, the user's intent can be more accurately judged. The specific steps include: 1. **Principle and Data Preparation**: Leverage the text generation capabilities of large language models to fine-tune based on predefined datasets and specific formats. 2. **Fine-tuning the Model**: Use the LLaMA-Factory tool for training, selecting appropriate prompt templates and optimized data formats. 3.