Introduction
You have a choice in the type of content which you want to generate:
Images (Computer Vision)
- Using a pre-trained HuggingFace stable diffusion model to generate artistic images from input image & a prompt, we want to further train it on a niche dataset (in our case containing images of cars in different locations) so that we can then generate high quality car images with artistic styles through a text prompt.
- We then want to use our generated car images for two further use cases:
- To replace the car in the image with a different one, using a generated mask and another stable diffusion model.
- To better the image quality with the stable diffusion upscaling model.
Text (Natural Language Processing)
- Using a pre-trained HuggingFace text generation model, we want to be able to generate a set of textual data using simple, intuitive input prompts. The output can be evaluated based on the believability of the output. Here are some examples of the things which can be generated:
- A news article based around a specific topic, on current events (e.g. space)
- An imaginary product based on reviews or product specifications (e.g. amazon reviews or product data)
- Synthetic Data which can be used to improve an existing machine learning algorithm on a specific use-case.
Tools and Technologies
Development Environment
Google Colab Model Platform
APIs
HuggingFace, OpenAI
Models
Computer Vision
Stable Diffusion
Natural Language Processing
GPT-3 (Open-AI) GPT-Neo/NeoX (2.7B and 20B respectively) - The models to be used for the Natural Language Processing area of the challenge are relatively flexible. Feel free to use open-source large language models which can be found on the HuggingFace Platform.
Inspiration
Useful resources and example notebooks can be found at this Google Drive link.
https://drive.google.com/drive/u/1/folders/1R4boDg9kBaxKKLLbnuGQKszeGuKgPSs1
Additional Resource links
GPT-3 Models
https://platform.openai.com/docs/models
NeoX (20B)
https://huggingface.co/EleutherAI/gpt-neox-20b
Neo(2.7B)
https://huggingface.co/EleutherAI/gpt-neo-2.7B
General stable diffusion model
https://huggingface.co/docs/diffusers/usingdiffusers/conditional_image_generation
Masked stable diffusion model
https://huggingface.co/docs/diffusers/usingdiffusers/inpaint
Depth map stable diffusion model
https://huggingface.co/docs/diffusers/usingdiffusers/depth2img
Upscaling stable diffusion model
https://huggingface.co/docs/diffusers/v0.13.0/en/api/pipelines/stable_diffusion/upscale
DL mask generation model
https://github.com/isl-org/MiDaS
Sponsors - REPLY
Website: https://www.reply.com/it/about/careers/HomePage
Instagram: https://www.instagram.com/replyulabs/
Facebook: https://www.facebook.com/ReplyUlabs
Twitter: https://twitter.com/ReplyULabs
Youtube: https://www.youtube.com/channel/UCKaTaWyDxiunGFoaIKk_8wQ
LinkedIn: https://www.linkedin.com/company/reply/life?trk=nav_type_life
Reply Ambassador: https://www.reply.com/it/about/ambassadors
Reply Careers: https://www.reply.com/it/about/careers/HomePage
Contact Us & Support Channels
Website: https://hackcambridge.com
Email: mailto:team@hackcambridge.com
Facebook: https://www.facebook.com/hackcambridge
Twitter: https://twitter.com/Hack_Cambridge/
Instagram: https://www.instagram.com/hack_cambridge/
LinkedIn: https://www.linkedin.com/company/hack-cambridge
