Add TokenCount metric by pantonante · Pull Request #74 · relari-ai/continuous-eval

pantonante · 2024-08-03T03:29:14Z

🚀	This description was created by Ellipsis for commit `dd610a6`

Summary:

Added TokenCount metric to count tokens in retrieved context using tiktoken encoder or an approximation, with tests and documentation.

Key points:

Added TokenCount metric in continuous_eval/metrics/retrieval/tokens.py.
Updated continuous_eval/metrics/retrieval/__init__.py to include TokenCount.
Implemented TokenCount class to count tokens using tiktoken encoder or an approximation.
Added tests for TokenCount in tests/retrieval_metrics_test.py.
Added documentation for TokenCount in docs/src/content/docs/metrics/Retrieval/Deterministic/token_count.md.
Updated metrics overview in docs/src/content/docs/metrics/overview.md to include TokenCount.

Generated with ❤️ by ellipsis.dev

ellipsis-dev

❌ Changes requested. Reviewed everything up to abedd8b in 36 seconds

More details

Looked at 88 lines of code in 3 files
Skipped 0 files when reviewing.
Skipped posting 0 drafted comments based on config settings.

Workflow ID: wflow_arh5EbQPeJG41abu

Want Ellipsis to fix these issues? Tag @ellipsis-dev in a comment. You can customize Ellipsis with 👍 / 👎 feedback, review rules, user-specific overrides, quiet mode, and more.

ellipsis-dev · 2024-08-03T03:29:58Z

continuous_eval/metrics/retrieval/tokens.py

+
+from continuous_eval.metrics.base import Metric
+
+_CHARACTERS_PER_TOKEN = 4.0


Consider using a more dynamic or configurable approach for _CHARACTERS_PER_TOKEN instead of a fixed value to better accommodate different languages or text formats.

ellipsis-dev

👍 Looks good to me! Incremental review on d39208a in 16 seconds

More details

Looked at 72 lines of code in 2 files
Skipped 0 files when reviewing.
Skipped posting 1 drafted comments based on config settings.

1. docs/src/content/docs/metrics/overview.md:96

Draft comment:
The documentation for the TokenCount metric is accurate and aligns with the detailed documentation in token_count.md. It correctly lists the metric under the Deterministic category for Retrieval metrics and provides a brief definition and the required input.
Reason this comment was not posted:
Confidence changes required: 0%
The documentation for the TokenCount metric in the overview.md file is accurate and aligns with the detailed documentation in token_count.md. It correctly lists the metric under the Deterministic category for Retrieval metrics and provides a brief definition and the required input. No issues or inconsistencies are found in this documentation snippet.

Workflow ID: wflow_OHoTzzysnHKRafsY

You can customize Ellipsis with 👍 / 👎 feedback, review rules, user-specific overrides, quiet mode, and more.

ellipsis-dev

👍 Looks good to me! Incremental review on dd610a6 in 34 seconds

More details

Looked at 86 lines of code in 2 files
Skipped 0 files when reviewing.
Skipped posting 1 drafted comments based on config settings.

1. README.md:18

Draft comment:
The PR description mentions the addition of the TokenCount metric, but the diff does not include changes to any Python files such as continuous_eval/metrics/retrieval/tokens.py. Please ensure that all relevant changes are included in the PR.
Reason this comment was not posted:
Confidence of 0% on close inspection, compared to threshold of 50%.

Workflow ID: wflow_dUTdZ3I9ja94hUpX

You can customize Ellipsis with 👍 / 👎 feedback, review rules, user-specific overrides, quiet mode, and more.

Add TokenCount metric

abedd8b

pantonante requested a review from yisz August 3, 2024 03:29

ellipsis-dev bot reviewed Aug 3, 2024

View reviewed changes

yisz approved these changes Aug 3, 2024

View reviewed changes

token count docs

d39208a

ellipsis-dev bot reviewed Aug 4, 2024

View reviewed changes

update readme + bump version

dd610a6

ellipsis-dev bot reviewed Aug 4, 2024

View reviewed changes

yisz merged commit a9823db into main Aug 4, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add TokenCount metric#74

Add TokenCount metric#74
yisz merged 3 commits intomainfrom
metric/token_count

pantonante commented Aug 3, 2024 •

edited by ellipsis-dev bot

Loading

Uh oh!

ellipsis-dev bot left a comment

Uh oh!

ellipsis-dev bot Aug 3, 2024

Uh oh!

ellipsis-dev bot left a comment

Uh oh!

ellipsis-dev bot left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants


		from continuous_eval.metrics.base import Metric

		_CHARACTERS_PER_TOKEN = 4.0

Conversation

pantonante commented Aug 3, 2024 • edited by ellipsis-dev bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary:

Uh oh!

ellipsis-dev bot left a comment

Choose a reason for hiding this comment

Uh oh!

ellipsis-dev bot Aug 3, 2024

Choose a reason for hiding this comment

Uh oh!

ellipsis-dev bot left a comment

Choose a reason for hiding this comment

Uh oh!

ellipsis-dev bot left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

pantonante commented Aug 3, 2024 •

edited by ellipsis-dev bot

Loading