llm factory update by yisz · Pull Request #76 · relari-ai/continuous-eval

yisz · 2024-09-02T00:21:37Z

Updated default eval model to gpt-4o-mini
Updated json method

🚀	This description was created by Ellipsis for commit `e0d0a4d`

Summary:

Updated continuous_eval/llm_factory.py to set gpt-4o-mini as default model, added JSON parsing method, and increased retry attempts for LLM responses.

Key points:

Updated DefaultLLM to use gpt-4o-mini as the default model in continuous_eval/llm_factory.py.
Added json method to LLMInterface and implemented it in LLMFactory to parse JSON from LLM output.
Increased retry attempts in LLMFactory._llm_response from 15 to 50.
Removed max_tokens parameter from LLMInterface.run method signature.
Adjusted CohereClient.generate call to use a fixed max_tokens value of 1024.

Generated with ❤️ by ellipsis.dev

ellipsis-dev

❌ Changes requested. Reviewed everything up to e0d0a4d in 24 seconds

More details

Looked at 78 lines of code in 1 files
Skipped 0 files when reviewing.
Skipped posting 1 drafted comments based on config settings.

1. continuous_eval/llm_factory.py:241

Draft comment:
Backticks are used in strip method, which is incorrect. Use regular quotes instead.

            json_output = llm_output.strip("

").strip(" ").replace("json", "")

- **Reason this comment was not posted:** 
Marked as duplicate.

</details>


Workflow ID: <workflowid>`wflow_lm2kNa2WcSNelVFV`</workflowid>

</details>


----
**Want Ellipsis to fix these issues?** Tag `@ellipsis-dev` in a comment. You can customize Ellipsis with :+1: / :-1: [feedback](https://docs.ellipsis.dev/review), review rules, user-specific overrides, `quiet` mode, and [more](https://docs.ellipsis.dev/config).

ellipsis-dev · 2024-09-02T00:22:07Z

continuous_eval/llm_factory.py

+        llm_output = self.run(prompt, temperature, max_tokens=max_tokens)
+        if "{" in llm_output:
+            first_bracket = llm_output.index("{")
+            json_output = llm_output[first_bracket:].strip("```").strip(" ")


Backticks are used in strip method, which is incorrect. Use regular quotes instead.

Suggested change

json_output = llm_output[first_bracket:].strip("```").strip(" ")

json_output = llm_output[first_bracket:].strip("

").strip(" ")

ellipsis-dev · 2024-09-02T00:22:08Z

continuous_eval/llm_factory.py

        elif COHERE_AVAILABLE and isinstance(self.client, CohereClient):
            prompt = f"{prompt['system_prompt']}\n{prompt['user_prompt']}"
-            response = self.client.generate(model="command", prompt=prompt, temperature=temperature, max_tokens=max_tokens)  # type: ignore
+            response = self.client.generate(model="command", prompt=prompt, temperature=temperature, max_tokens=1024)  # type: ignore


The max_tokens parameter is hardcoded to 1024. Consider using the max_tokens argument instead.

Suggested change

response = self.client.generate(model="command", prompt=prompt, temperature=temperature, max_tokens=1024) # type: ignore

response = self.client.generate(model="command", prompt=prompt, temperature=temperature, max_tokens=max_tokens) # type: ignore

llm factory update

e0d0a4d

yisz merged commit eb8384e into main Sep 2, 2024

ellipsis-dev bot reviewed Sep 2, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

llm factory update#76

llm factory update#76
yisz merged 1 commit intomainfrom
fix/llm-factory

yisz commented Sep 2, 2024 •

edited by ellipsis-dev bot

Loading

Uh oh!

ellipsis-dev bot left a comment

Uh oh!

ellipsis-dev bot Sep 2, 2024

Uh oh!

ellipsis-dev bot Sep 2, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

	json_output = llm_output[first_bracket:].strip("```").strip(" ")
	json_output = llm_output[first_bracket:].strip("

	response = self.client.generate(model="command", prompt=prompt, temperature=temperature, max_tokens=1024) # type: ignore
	response = self.client.generate(model="command", prompt=prompt, temperature=temperature, max_tokens=max_tokens) # type: ignore

Conversation

yisz commented Sep 2, 2024 • edited by ellipsis-dev bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary:

Uh oh!

ellipsis-dev bot left a comment

Choose a reason for hiding this comment

Uh oh!

ellipsis-dev bot Sep 2, 2024

Choose a reason for hiding this comment

Uh oh!

ellipsis-dev bot Sep 2, 2024

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

yisz commented Sep 2, 2024 •

edited by ellipsis-dev bot

Loading