Build a deep learning-powered image search system with Jina + BigTransfer (BiT)

Learn step by step how to build a neural image search system from scratch with open source tools.

Cristian Mitroi, Maximilian Werk

Computer Vision Data Science Deep Learning Machine-Learning

See in schedule: Mon, Jul 26, 13:15-14:45 CEST (90 min)

From scientific applications to consumer software to internal systems in the enterprises, deep learning technology is transforming how we interact with and make use of all the data. Especially with the rapid growth of media and metadata in these sectors, there is an evolving need for search systems to go beyond the conventional search approach (symbolic search) and towards semantic search, or neural search.

Deep learning technology provides a good base to semantically search for information. However, building a neural search system is non-trivial for researchers and engineers without good understanding of both search workflows and deep learning models.

In this workshop, we will show you step by step how you can make use of open source tools (Jina + BigTransfer model) to build a neural search system that performs image-to-image search with an open source dataset, in an easier way.

Jina is an open source neural search framework that empowers software developers to utilize deep learning models to build search systems that are fast, scalable, and work with any kind of data.

Development environment: WSL 2 (Windows) / Linux / MacOS with Python 3.7

NOTE: You will need to pre-install the requirements, download a dataset, and a ML model. Make sure to download this repo and follow the instructions in the README: https://github.com/jina-ai/workshops/tree/main/pokedex#before-the-workshop

Alternatively, you can follow along with the the notebook in Google Colaboratory: https://colab.research.google.com/github/jina-ai/workshops/blob/main/pokedex/workshop.ipynb

Agenda:
Intro to Neural Search
Environment Setup
Jina Basics
Advanced configuration of Jina (replacing model, replacing preprocessing pipeline etc.)
Pipeline hyperparameter optimization

Type: Training (180 mins); Python level: Beginner; Domain level: Intermediate


Cristian Mitroi

Jina AI

https://github.com/cristianmtr
https://www.linkedin.com/in/cristianmitroi/

I'm a Machine Learning Engineer with a practical mindset and a passion for the field.
I work at Jina AI, in Berlin, where I am making neural search open source and universal.
I have six years of professional experience across the stack (Machine Learning, Backend, DevOps, Frontend).

Maximilian Werk

Jina AI

After studying mathematics I work since 7 years as a ML Engineer. Currently, I help bringing the vision of an open-source Neural Search framework into reality at Jina AI. While bringing machine learning into production is awesome, I strongly focus on the engineering aspect.

I fell heavily in love with the concept of best practices and the design principles in Python. Breaking complex problems into not-so-complex sub-problems and finding an elegant solution design is my passion.