Introduction

You have a choice in the type of content which you want to generate:

 

Images (Computer Vision)
  • Using a pre-trained HuggingFace stable diffusion model to generate artistic images from input image & a prompt, we want to further train it on a niche dataset (in our case containing images of cars in different locations) so that we can then generate high quality car images with artistic styles through a text prompt.
  • We then want to use our generated car images for two further use cases:
    • To replace the car in the image with a different one, using a generated mask and another stable diffusion model.
    • To better the image quality with the stable diffusion upscaling model.

 

Text (Natural Language Processing)
  • Using a pre-trained HuggingFace text generation model, we want to be able to generate a set of textual data using simple, intuitive input prompts. The output can be evaluated based on the believability of the output. Here are some examples of the things which can be generated:
  1. A news article based around a specific topic, on current events (e.g. space)
  2. An imaginary product based on reviews or product specifications (e.g. amazon reviews or product data)
  3. Synthetic Data which can be used to improve an existing machine learning algorithm on a specific use-case. 

 

Tools and Technologies

Development Environment

Google Colab Model Platform

 

APIs

HuggingFace, OpenAI

 

Models

Computer Vision

Stable Diffusion

 

Natural Language Processing

GPT-3 (Open-AI) GPT-Neo/NeoX (2.7B and 20B respectively) - The models to be used for the Natural Language Processing area of the challenge are relatively flexible. Feel free to use open-source large language models which can be found on the HuggingFace Platform.

 

Inspiration

Useful resources and example notebooks can be found at this Google Drive link.
https://drive.google.com/drive/u/1/folders/1R4boDg9kBaxKKLLbnuGQKszeGuKgPSs1

 

GPT-3 Models
https://platform.openai.com/docs/models

NeoX (20B)
https://huggingface.co/EleutherAI/gpt-neox-20b

Neo(2.7B)
https://huggingface.co/EleutherAI/gpt-neo-2.7B

General stable diffusion model
https://huggingface.co/docs/diffusers/usingdiffusers/conditional_image_generation

Masked stable diffusion model
https://huggingface.co/docs/diffusers/usingdiffusers/inpaint

Depth map stable diffusion model
https://huggingface.co/docs/diffusers/usingdiffusers/depth2img

Upscaling stable diffusion model
https://huggingface.co/docs/diffusers/v0.13.0/en/api/pipelines/stable_diffusion/upscale

DL mask generation model
https://github.com/isl-org/MiDaS

 

Sponsors - REPLY

Website: https://www.reply.com/it/about/careers/HomePage
Instagram: https://www.instagram.com/replyulabs/
Facebook: https://www.facebook.com/ReplyUlabs
Twitter: https://twitter.com/ReplyULabs
Youtube: https://www.youtube.com/channel/UCKaTaWyDxiunGFoaIKk_8wQ
LinkedIn: https://www.linkedin.com/company/reply/life?trk=nav_type_life
Reply Ambassador: https://www.reply.com/it/about/ambassadors
Reply Careers: https://www.reply.com/it/about/careers/HomePage

 

Contact Us & Support Channels

Website: https://hackcambridge.com
Email: mailto:team@hackcambridge.com
Facebook: https://www.facebook.com/hackcambridge
Twitter: https://twitter.com/Hack_Cambridge/
Instagram: https://www.instagram.com/hack_cambridge/
LinkedIn: https://www.linkedin.com/company/hack-cambridge