Anyscale And MongoDB Collaborate To Enhance Multi-Modal Search

Terrill Dicki Jul 26, 2024 03:04

Anyscale and MongoDB join forces to revamp multi-modal search, offering scalable solutions and improved search relevance for e-commerce platforms.

Anyscale and MongoDB Collaborate to Enhance Multi-Modal Search

Anyscale, a leading AI application platform, has announced a collaboration with MongoDB to improve multi-modal search capabilities, according to Anyscale. This partnership aims to address the limitations of traditional search systems and provide a more sophisticated search experience for enterprises dealing with large volumes of multi-modal data.

Challenges with Legacy Search Systems

Enterprises often struggle with legacy search systems that are not equipped to handle the complexities of multi-modal data, which includes text, images, and structured data. Traditional systems typically rely on lexical search methods that match text tokens, resulting in poor recall and irrelevant search results.

For instance, an e-commerce platform searching for a “green dress” might return items like “Bio Green Apple Shampoo” due to the limitations of lexical search. This is because the search system only matches text tokens and does not understand the semantic meaning behind the query.

Innovative Solution Using Anyscale and MongoDB

The collaboration between Anyscale and MongoDB aims to overcome these limitations by leveraging advanced AI models and scalable data indexing pipelines. The solution involves:

  • Using Anyscale to run multi-modal large language models (LLMs) to generate product descriptions from images and names.

  • Generating embeddings for product names and descriptions, which are then indexed into MongoDB Atlas Vector Search.

  • Creating a hybrid search backend that combines legacy text matching with advanced semantic search capabilities.

This approach enhances the search relevance and user experience by understanding the semantic context of queries and returning more accurate results.

Use Case: E-commerce Platform

An example use case is an e-commerce platform with a large catalog of products. The platform aims to improve its search capabilities by implementing a scalable multi-modal search system that can handle both text and image data. The dataset used for this implementation is the Myntra dataset, which contains images and metadata of products for Myntra, an Indian fashion e-commerce company.

The legacy search system only matched text tokens, resulting in irrelevant search results. By using Anyscale and MongoDB, the platform can now return more relevant results by understanding the semantic meaning of queries and using images to enrich the search context.

System Architecture

The system is divided into two main stages: an offline data indexing stage and an online search stage. The data indexing stage processes, embeds, and upserts text and images into MongoDB, while the search stage handles search requests in real-time.

Data Indexing Stage

This stage involves:

  • Metadata enrichment using multi-modal LLMs to generate product descriptions and metadata fields.

  • Embedding generation for product names and descriptions.

  • Data ingestion into MongoDB Atlas Vector Search.

Search Stage

The search stage combines legacy text matching with advanced semantic search. It involves:

  1. Sending a search request from the frontend.

  2. Processing the request at the ingress deployment.

  3. Generating embeddings for the query text.

  4. Performing a vector search on MongoDB.

  5. Returning the search results to the frontend.

Conclusion

The collaboration between Anyscale and MongoDB represents a significant advancement in multi-modal search technology. By integrating advanced AI models and scalable data indexing pipelines, enterprises can now offer a more relevant and efficient search experience. This solution is particularly beneficial for e-commerce platforms looking to improve their search capabilities and user experience.

For more information, visit the Anyscale blog.

Image source: Shutterstock
RECENT NEWS

Ether Surges 16% Amid Speculation Of US ETF Approval

New York, USA – Ether, the second-largest cryptocurrency by market capitalization, experienced a significant surge of ... Read more

BlackRock And The Institutional Embrace Of Bitcoin

BlackRock’s strategic shift towards becoming the world’s largest Bitcoin fund marks a pivotal moment in the financia... Read more

Robinhood Faces Regulatory Scrutiny: SEC Threatens Lawsuit Over Crypto Business

Robinhood, the prominent retail brokerage platform, finds itself in the regulatory spotlight as the Securities and Excha... Read more

Ethereum Lags Behind Bitcoin But Is Expected To Reach $14K, Boosting RCOF To New High

Ethereum struggles to keep up with Bitcoin, but experts predict a rise to $14K, driving RCOF to new highs with AI tools.... Read more

Ripple Mints Another $10.5M RLUSD, Launch This Month?

Ripple has made notable progress in the rollout of its stablecoin, RLUSD, with a recent minting of 10.5… Read more

Bitcoin Miner MARA Acquires Another $551M BTC, Whats Next?

Bitcoin mining firm Marathon Digital Holdings (MARA) has announced a significant milestone in its BTC acquisition strate... Read more