Conversation
|
The encoded tokens from the test look like this: And the test target looks like this: But even when changing the test target to the actually encoded tokens, it still crashes, so I still need to investigate. In any case, I have already verified that the DeepSeek models work with the latest Jinja. |
|
The test targets were obtained from the Python tokenizer, they correspond to Looking into it. |
|
As you can see here, the prompt from the test is being encoded correctly, and there are no problems interacting with the model, but as soon as you call @pcuenca, since you're more familiar with the library's internals than me, perhaps you have a better intuition about how to approach the solution. My initial attempts at solutions with Sonnet and the entire library as context were unsuccessful. |
Serialized AddedToken class partially supported (in addition to String values)
Package.swift
Outdated
| dependencies: [ | ||
| .package(url: "https://github.com/apple/swift-argument-parser.git", from: "1.4.0"), | ||
| .package(url: "https://github.com/maiqingqiang/Jinja", from: "1.0.6") | ||
| .package(url: "https://github.com/johnmai-dev/Jinja", from: "1.1.0") |
There was a problem hiding this comment.
I'm going to move this and the chat template test to a new PR, since the rest of the fixes here are more general and unrelated to the new jinja engine.
|
Merging this. As explained, the jinja upgrade will come momentarily as these changes are general. |
The new test does not pass for some reason. Is there anything you think I'm doing wrong @DePasqualeOrg?