Image search

Image search helps you find similar images by comparing their visual content, not just text or metadata. This feature is useful for e-commerce, content moderation, digital asset management, and any scenario where you need to search for or deduplicate images based on appearance.

TiDB enables image search using vector search. With automatic embedding, you can generate image embeddings from image URLs, PIL images, or keyword text using a multimodal embedding model. TiDB then efficiently searches for similar vectors at scale.

Tip

For a complete example of image search, see the Pet image search demo.

Basic usage

Step 1. Define an embedding function

To generate image embeddings, you need an embedding model that supports image input.

For demonstration, you can use Jina AI's multimodal embedding model to generate image embeddings.

Go to Jina AI to create an API key, then initialize the embedding function as follows:

from pytidb.embeddings import EmbeddingFunction

image_embed = EmbeddingFunction(
    # Or another provider/model that supports multimodal input
    model_name="jina_ai/jina-embedding-v4",
    api_key="{your-jina-api-key}",
)

Step 2. Create a table and vector field

Use VectorField() to define a vector field for storing image embeddings. Set the source_field parameter to specify the field that stores image URLs.

from pytidb.schema import TableModel, Field

class ImageItem(TableModel):
    __tablename__ = "image_items"
    id: int = Field(primary_key=True)
    image_uri: str = Field()
    image_vec: list[float] = image_embed.VectorField(
        source_field="image_uri"
    )

table = client.create_table(schema=ImageItem, if_exists="overwrite")

Step 3. Insert image data

When you insert data, the image_vec field is automatically populated with the embedding generated from the image_uri.

table.bulk_insert([
    ImageItem(image_uri="https://example.com/image1.jpg"),
    ImageItem(image_uri="https://example.com/image2.jpg"),
    ImageItem(image_uri="https://example.com/image3.jpg"),
])

Step 4. Perform image search

Image search is a type of vector search. Automatic embedding lets you input an image URL, PIL image, or keyword text directly. All these inputs are converted to vector embeddings for similarity matching.

Option 1: Search by image URL

Search for similar images by providing an image URL:

results = table.search("https://example.com/query.jpg").limit(3).to_list()

The client converts the input image URL into a vector. TiDB then finds and returns the most similar images by comparing their vectors.

Option 2: Search by PIL image

You can also search for similar images by providing an image file or bytes:

from PIL import Image

image = Image.open("/path/to/query.jpg")

results = table.search(image).limit(3).to_list()

The client converts the PIL image object into a Base64 string before sending it to the embedding model.

Option 3: Search by keyword text

You can also search for similar images by providing keyword text.

For example, if you are working on a pet image dataset, you can search for similar images by keywords like "orange tabby cat" or "golden retriever puppy".

results = table.search("orange tabby cat").limit(3).to_list()

The keyword text will be converted to a vector embedding that captures the semantic meaning by the multimodal embedding model, and then a vector search will be performed to find the images whose embeddings are most similar to the keyword embedding.