Skip to content

AshwinSant5/GeoNet

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

13 Commits
 
 
 
 
 
 
 
 

Repository files navigation

GeoNet

GeoNet is a deep learning model designed for geolocating images based on visual cues, built on a hybrid architecture that combines Vision Transformers (ViT) and Convolutional Neural Networks (CNNs). This combination allows GeoNet to effectively extract both local and global features from images, enabling more accurate geolocation predictions. The model processes input images and predicts the geographic location where the image was captured, making it suitable for applications such as geolocation-based games, mapping, and research.

Features

Hybrid Architecture: Utilizes both Vision Transformers (ViT) and Convolutional Neural Networks (CNNs) to capture global and local image features for more robust geolocation predictions.

Accurate Image Geolocation: Predicts geographic coordinates (latitude and longitude) based on visual information within the input image.

Extensive Training: The model has been trained on a diverse dataset of geotagged images sourced from the Google Street View API, enhancing its ability to generalize across different landscapes, environments, and urban structures.

Applications: Useful in location-based services, geolocation games (e.g., GeoGuessr), mapping applications, and research that requires spatial analysis of image data.

Video Presentation and Demo

https://www.youtube.com/watch?v=hvCGrNylYic

Collaborators

Ashwin Santhosh

Alex Guo

Daniel Rolfe

Nolan Young

License & Copyright

This project is subject to copyright.

© 2024 Ashwin Santhosh, Alex Guo, Daniel Rolfe, Nolan Young. All rights reserved.

About

Deep Learning model capable of image geolocation and autonomously playing the famous geolocation game GeoGuessr.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors