Tag: Stepfun

China's AI Rise, Geopolitics, and US Strategy

Explore StepFun's AI advancements, China's geopolitical strategy, US efforts to maintain tech leadership, and US-China relations through insights.

China's AI Rise, Geopolitics, and US Strategy

Step1X-Edit: Open-Source Image Editing Model

Step1X-Edit, developed by StepFun, is an open-source image editing model achieving SOTA performance. It excels in semantic analysis, identity preservation, and region control, supporting 11 image editing tasks, rivaling GPT-4o and Gemini 2.0 Flash.

Step1X-Edit: Open-Source Image Editing Model

Step New Attention Mechanism KV Cache Reduced

This article explores Multi-matrix Factorization Attention (MFA) and MFA-Key-Reuse (MFA-KR), novel attention mechanisms that significantly reduce KV cache usage in large language models (LLMs). MFA and MFA-KR achieve performance comparable to or exceeding traditional MHA and MLA while substantially lowering memory consumption. Key innovations include increasing attention head dimensions, employing low-rank decomposition, and using a single key-value head. Experimental results demonstrate significant memory savings and scalability, making MFA a promising solution for efficient LLM inference.

Step New Attention Mechanism KV Cache Reduced