You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Picking up where epoch #79 left off. Triaged sprint:
Implement Responses API: misleading error on context overflow, must communicate token limit exceeded #19 fix: Responses API truncation status — verified last cycle that to_json_oaicompat_resp{,_stream} still hardcode "status": "completed" regardless of stop == STOP_TYPE_LIMIT. The minimum fix per the issue: branch on stop type, emit "status": "incomplete" + "incomplete_details": {"reason": "max_output_tokens"} per OAI Responses spec. Test via the existing pytest harness with low --n-predict + check the JSON shape.
CI workflow audit round 2 — last epoch covered the zero-job-on-master pattern (build-cann.yml → PR ci(cann): drop auto-triggers (workflow has zero jobs upstream) #80). Other patterns worth scanning: self-hosted runner workflows that may not have runners in our fork, workflows with default tokens that need org-scoped permissions.
Picking up where epoch #79 left off. Triaged sprint:
to_json_oaicompat_resp{,_stream}still hardcode"status": "completed"regardless ofstop == STOP_TYPE_LIMIT. The minimum fix per the issue: branch on stop type, emit"status": "incomplete"+"incomplete_details": {"reason": "max_output_tokens"}per OAI Responses spec. Test via the existing pytest harness with low--n-predict+ check the JSON shape.out_bytes_per_deviceparameter directly). Add a focused C++ unit test or extend the fit-params smoke to assert plan-vs-status-success invariant.tools/server/server-queue.cpp— round 1 found races in server-models.cpp (PR fix(server): guard two mapping[] accesses against silent default-insert #77), round 2 found UB in server-context.cpp create_checkpoint (folded into PR fix(server): per-slot byte cap on context checkpoints (closes #67) #68). Next adjacent file. Smaller than server-context.cpp; pattern continues to be lifetime + lock-scope.For later / saved:
Blocked on Markus / snoop-kube (unchanged):