Just one second...

AdminLTELogo

Read me!

This is a demo web app.
It does not reflect the complete functionality of the project.
Currently the registration for new users is turned off.

Project capabilities

Data Management

  • Automatic data import from AWS S3.
  • Upload of data into datasets
  • Simple and Fast creation of image embeddings
  • Data Indexing in high dimensional vector space for fast similarity search

Collection Querying / Similarity Search

  • Query Collections with images from the database or by uploading a file
  • Fast query times for similarity search
  • Finds duplicates and similar images in a few milliseconds

Data visualization

  • Comprehensive exploration of image embeddings through an interactive visualization
  • Find clusters of similar images
  • Analyze the dataset interactively by navigating a 2D-Vector Space
  • Query images and see where similar images are located in the 2D-Vector Space

Demo Restrictions

What is possible?

  • Exploration of public Collection "Open Images"
  • Query and Explore the collection by either uploading your own query images
    or navigating through the interactive visualization

What is not possible?

  • User Registration disabled
  • Dataset and Collection Creation disabled
  • Import of new data disabled

Tech Stack

Frontend

  • React
  • Typescript
  • deck.gl for visualization
  • Auth0 for Authentication

Backend - Web Application / Rest Server

  • NodeJS / ExpressJS
  • MongoDB

Workflow Orchestration

  • Apache Airflow
  • Python

Vector Embeddings and Data Analysis

  • Python
  • Tensorflow, Image Classification Models, Embedding Models
  • VGG16 Vector Embeddings
  • Milvus Vector Database
  • RabbitMQ - Message Queue
  • T-SNE, UMAP

Cloud

  • AWS S3
  • AWS Batch
  • AWS EC2