MemoryMake

As memories pile up in our camera rolls or remain abstract ideas, we wanted to create a way to truly experience them. MemoryMake was born from the idea of transforming ordinary photos and text prompts into dynamic, navigable, and visually engaging 3D environments—so users can explore their cherished moments instead of merely scrolling past them.

We developed our own efficient 3D model generation pipeline, which takes in a panorama image and returns a 3D model that can be viewed in a 180-degree environment, leveraging cutting edge computer vision and generative AI research.

In a society where everyone is constantly thinking about going forwards, we wanted an opportunity to appreciate our memories and live in them once again.

This is a project submitted to Qhacks 2025, learn more at our Devpost

Demo

View the demo on YouTube

Tech Stack + Features

3D Model Generation: Our backend uses MiDaS monocular depth estimation, developed at Intel labs, combined with Open3D mesh generation, and our custom algorithm to turn point-clouds to cylindrical coordinates to recreate a 180 degree view of the scene.
180 Degree View: The 3D model is returned to the frontend, where you can zoom, pan, and navigate around your memory using Three.js. You can easily browse through collections and navigate inside the 3D environment.
Style Selection: Choose from artistic styles like Photorealistic, Monet, or Van Gogh, which will overlay onto a realistic place that you upload using Neural Style Transfer.

Text-to-3D: Don't have a panorama? You can also input text prompts, which will be converted into a 3D environment using Stable Diffusion for text-to-image generation.

Workflow

The following diagram shows the workflow of the project:

References

MiDaS: https://arxiv.org/pdf/1907.01341v3
Neural Style Transfer: https://arxiv.org/pdf/1508.06576.pdf
Stable Diffusion: https://arxiv.org/pdf/2112.10752
CycleGAN: https://arxiv.org/pdf/1703.10593

Usage

Start React frontend:

cd frontend
npm install
npm run dev

Start FastAPI backend:

cd backend
uvicorn main:app --reload

For image generation, you will require the Hugging Face inference API key in a .env file in the backend directory. However, since the model is open-source, you can also modify the code to download the weights and run it locally. Simply uncomment the lines in backend/main.py to use code that runs locally:

stable_diffusion.generate_image_local(prompt, style, save_image_path)

All the models used in this project are open-source and free to use.

Next Steps

Enhance the prompt used to generate images (Stable Diffusion) to include more pronounced 3D features
Improve the 3D generation pipeline to minimize distortion and improve the overall quality of the generated 3D models
Render the 3D models in a VR headset!!
Implement CycleGAN to allow users to change the style of the generated 3D models (not just adding artistic styles)

Name		Name	Last commit message	Last commit date
Latest commit History 84 Commits
backend		backend
frontend		frontend
resource		resource
.gitignore		.gitignore
README.md		README.md
package-lock.json		package-lock.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MemoryMake

Demo

Tech Stack + Features

Workflow

References

Usage

Next Steps

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

MemoryMake

Demo

Tech Stack + Features

Workflow

References

Usage

Next Steps

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages