Allow the `llama_progress_callback` to abort model loading early without having to throw an exception

# Prerequisites

Please answer the following questions for yourself before submitting an issue.

- [x] I am running the latest code. Development is very rapid so there are no tagged versions as of now.
- [x] I carefully followed the [README.md](https://github.com/ggerganov/llama.cpp/blob/master/README.md).
- [x] I [searched using keywords relevant to my issue](https://docs.github.com/en/issues/tracking-your-work-with-issues/filtering-and-searching-issues-and-pull-requests) to make sure that I am creating a new issue that is not already open (or closed).
- [x] I reviewed the [Discussions](https://github.com/ggerganov/llama.cpp/discussions), and have a new bug or useful enhancement to share.

# Feature Description

Allow the `llama_progress_callback` to return a value that will stop the model being loaded, and free all resources.

# Motivation

LLMs can brush up against the limits of some computers, and sometimes you just need an emergency stop button. llama.cpp can already catch `std::exception`s inside the model loading process and clean up the half-loaded model, but unfortunately, non-C++ languages (such as Rust) can't throw `std::exception`s, so even if they do unwind, it won't be caught by llama.cpp's try-catch and the resources used by the model won't actually be properly cleaned up.

# Possible Implementation

Allow the `llama_progress_callback` to return a value that aborts model loading early. Maybe have it return a bool where `true` is continue and `false` is abort? This could totally bite existing codebases though since it's really subtle.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow the `llama_progress_callback` to abort model loading early without having to throw an exception #4551

Prerequisites

Feature Description

Motivation

Possible Implementation

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Allow the llama_progress_callback to abort model loading early without having to throw an exception #4551

Description

Prerequisites

Feature Description

Motivation

Possible Implementation

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions

Allow the `llama_progress_callback` to abort model loading early without having to throw an exception #4551