V-HelpPremium IT service for your business
← All news
Artificial intelligence

Pinterest Cuts AI Costs by 90% Through Model Customization

Pinterest Cuts AI Costs by 90% Through Model Customization

Photo: VentureBeat

Quick answer

Pinterest сократил расходы на ИИ на 90%, заменив стандартный vision-слой модели Qwen3-VL собственными эмбеддингами. Это улучшило качество рекомендаций на 30% и ускорило инференс в 20 раз.

Pinterest, serving over 620 million monthly users, faced high costs associated with using advanced AI models for generating recommendations. The solution was a radical redesign of the Qwen3-VL model’s architecture: engineers removed the standard vision layer and integrated custom embeddings trained on the platform’s unique data.

According to CTO Matt Madrigal, the key to success was working with open-source models that can be adapted for specific tasks. Pinterest’s team has long used such solutions—from BERT and CLIP to Qwen3-VL—but now focuses on deep customization. For example, the Navigator 1 shopping assistant model was rebuilt from scratch: instead of a standard vision encoder, custom multimodal embeddings were used, reducing inference latency by 20 times.

Another innovation was the "Taste Graph"—a dynamic system tracking user preferences in real time. Unlike social graphs, it focuses not on connections between users but on their interests, helping convert inspiration into actions like ad clicks or purchases. The model continuously updates based on new data, ensuring a personalized experience.

Madrigal emphasized that for mission-critical functions serving millions of users, the company either develops solutions in-house or maximally adapts open-source models. This approach not only reduces costs but also enhances service quality, which is especially important for a platform centered on visual search and discovery.

Common questions

Как Pinterest удалось сократить расходы на ИИ на 90%?
Компания заменила стандартный vision-слой модели Qwen3-VL собственными мультимодальными эмбеддингами, обученными на уникальных данных платформы.
Какие преимущества дала кастомизация модели Qwen3-VL?
Снижение затрат на 90%, улучшение качества рекомендаций на 30% и ускорение инференса в 20 раз за счёт использования собственных эмбеддингов.
Что такое «граф вкусов» в Pinterest?
Динамическая система, отслеживающая предпочтения пользователей в реальном времени для персонализации рекомендаций и повышения конверсии.
Почему Pinterest использует open-source модели для критически важных функций?
Это позволяет максимально адаптировать решения под специфические задачи платформы, снижая издержки и повышая качество сервиса.
Share:

Dzen feed: /feed/dzen.xml · RSS: /feed.xml

Why trust this

Prepared by the V-Help editorial team from the primary source with a published date.

Published by: V-Help.ru news desk

Source: VentureBeat