TiefVision Is a Deep-Learning Image Search Engine

TiefVision is an end-to-end deep learning image-similarity search engine.

TiefVision is an integrated end-to-end image-based search engine based on deep learning. It covers image classification, image location (OverFeat) and image similarity (Deep Ranking).

TiefVision is implemented in Torch and Play Framework (Scala version). It currently only supports Linux with CUDA-enabled GPU.

The project is divided into two module groups: Deep Learning Modules and Tooling Modules.

Deep Learning Modules

The deep learning modules included in TiefVision are the following:

1- Transfer Learning

TiefVision transfers a simplified (without grouping) AlexNet network that is used for encoding purposes. The steps involved in the transfer learning phase are the following:

It splits an already trained AlexNet neural network without grouping into two neural networks:
The lower convolutional part that acts as an encoder of high-level features (“image encoder")
The upper/top fully connected part that is discarded as it’s meant to classify images for other purposes (ImageNet classification).
It reduces the last max pooling step size from the encoder neural network (lower-part) to increase the spatial accuracy.

2- Image Classification

The image classification module performs the following steps:

It encodes all the crops from the target image (e.g. dresses) and its background using the encoder neural network:
Target Image Crops: crops of the images in such a way at least 50% of the crop is inside the target image bounding box. For a dataset of dresses, at least 50% of the crop contains a dress (it can include up to 50% of the background).
Background Image Crops: crops of the images in such a way at least 50% of the crop contains the target image background. For the example of dresses, at least 50% of the crop contains background.
It trains a fully connected neural network to classify the target image crops (e.g. dresses) and its background crops (e.g. photo studio background).

3- Image Location (based on OverFeat)

The image location module perform the following steps:

It encodes the Target Image Crop's dataset together with its normalized bounding box delta (distance between the bounding box upper-left point and the bounding box coordinates).
It trains four fully connected neural networks to predict the two relative bounding box points:
Two neural networks for the two dimensions of the upper-left point.
Two neural networks for the two dimensions of the lower-right point.
It extracts the bounding box filtering out background crops using the image classification neural network and averaging the bounding boxes using the bounding box neural network.

4- Image Similarity (based on Deep Ranking)

The similarity is based on the distance between two image encodings. TiefVision trains a neural network to map encoded images into a space in which the dot product acts as a similarity distance between images. As the encodings are normalized, the dot product computes the cosine of the angle between the encodings.

Given the following triplets of images:

H: a reference image
H+: an image similar to the reference image (H).
H-: another image that is similar to H but not as similar as H+.

It trains a neural network to make H+ closer to H than H- using the Hinge loss: l(H, H+, H-) = max(0, margin + D(H, H+) - D(H, H-)) where D is the dot product of the two images mapped into the neural network’s output space: D(H1, H2) = NN(H1) · NN(H2)

Tooling Modules

TiefVision includes a set of web tools to ease the generation of datasets and thus increase productivity.

The current tools are the following:

Visual Bounding Box Database Editor
Visual Similarity Database Editor
Web File-based Image Search
Image Gallery Browser (search upon click)
Automated Train and Test dataset generation for:
Image (crop) classification
Bounding box regression
Image Similarity (Deep Ranking)

License and Copyright

Resources

Source code

Introducing Jan: A Powerful Open-Source Alternative to ChatGPT for Your Desktop and Docker

What is Jan? Are you in search of a reliable, open-source alternative to ChatGPT? Look no further! We introduce you to Jan, a powerful AI chatbot that runs 100% offline on your computer. Unlike many other AI-powered chatbots, Jan offers you complete privacy and security as it operates entirely offline.

INCEpTION is an open-source Semantic Annotation

What is INCEpTION? INCEpTION is a sophisticated semantic annotation platform, diligently developed by the UKP Lab at the esteemed Technical University of Darmstadt. Its primary objective is to centralize a diverse range of semantic annotation tasks into a single, user-friendly web-based platform. This innovative open-source free platform revolutionizes the annotation

A Look at 9 Free and Open-source Face Detection and Recognition Libraries

In this comprehensive blog post, we will be diving deep into the enthralling and rapidly evolving world of face detection and recognition libraries. These advanced and powerful tools are revolutionizing the way we interact with our digital surroundings. By offering more secure means of authentication, they significantly improve the security

22 News Aggregator Apps for Timely Updates: Free Options with AI Support

What is a News Aggregator A news aggregator is a tool or platform that collects news from various sources and presents them in one location. It is used for conveniently accessing and reading news from multiple sources without having to visit each source individually. In this list, we collected the

Autodistill: The Future of Efficient, AI-Powered Image Recognition

What is Autodistill? Autodistill is a free and open-source AI project that uses large, slower foundation models to train smaller, faster supervised models, allowing for inference on unlabeled images with no human intervention. It can be used on personal hardware or via Roboflow's hosted version for cloud-based image

Payload Wizard: The AI Assistant for Pentesters

What are cyber security payloads? In cybersecurity, a payload refers to the part of the malicious code that performs the harmful action; this could range from stealing data to damaging systems. For penetration testers, payloads are used in a controlled and ethical manner to probe system vulnerabilities and validate defenses.

Empower your Coding Experience in VSCode with this Amazing AI Copilot

What is Collama? Collama is a free and open-source VSCode AI coding assistant powered by self-hosted llama.cpp endpoint. Under the hood It uses LLaMA.cpp, the Inference of Meta's LLaMA model (and others) in pure C/C++. Llama.cpp is designed to facilitate LLM inference with minimal

Summarize Hacker News with AI: Discover the Open-Source Project Enhancing Article Digests

Hacker News Summary is an open-source project that uses AI technology, primarily the ChatGPT gpt-3.5-turbo model, to extract summaries and illustrations from Hacker News articles. If ChatGPT is unavailable, the local GoogleT5 model is used. The service provides clear and easily understandable summaries. Features * Clear and easily understandable summaries

Develop Advanced AI Bots Effortlessly with Bot Server's Comprehensive Tools

Bot Server is an open-source bot development platform that streamlines AI bot development by providing code base, resources, cloud deployment, and templates. It allows modifications via a downloadable zip file, and supports advanced development with custom code in various editors. It's GPT-powered, compatible with Bot Framework V4, and

Navigate Your Digital World with OneReality: The AI-Powered Multilingual Assistant

OneReality is an AI application that functions as a multilingual virtual assistant with features such as voice interaction, short and long-term memory, app control, and smart home integration. The AI uses SpeechRecognition and OpenAI's Whisper for speech-to-text transcription, and responses are generated using multiple checks and procedures. The

TiefVision Is a Deep-Learning Image Search Engine

Hazem Abbas

Deep Learning Modules

1- Transfer Learning

2- Image Classification

3- Image Location (based on OverFeat)

4- Image Similarity (based on Deep Ranking)

Tooling Modules

License and Copyright

Resources

Tags

Deep Learning Modules

1- Transfer Learning

2- Image Classification

3- Image Location (based on OverFeat)

4- Image Similarity (based on Deep Ranking)

Tooling Modules

License and Copyright

Resources

Tags

Related Articles in Artificial Intelligence