Add sql metrics by yisz · Pull Request #63 · relari-ai/continuous-eval

yisz · 2024-05-21T03:00:13Z

Added an AST comparison for SQL based on the weighted changes in diff tree.

🚀 This description was created by Ellipsis for commit `6eb410a`

Summary:

Enhanced SQL code evaluation with new metrics, updated documentation, and added tests, including specific file paths and a new dependency.

Key points:

Added SQLSyntaxMatch and SQLASTSimilarity metrics for SQL code evaluation.
Updated README.md to include new SQL metrics in the metrics table.
Refactored Python package structure to include SQL metrics.
Updated documentation for new SQL metrics.
Added tests for new SQL metrics.
Introduced SQLSyntaxMatch and SQLASTSimilarity metrics in continuous_eval/metrics/code/sql/deterministic.py.
Updated README.md and documentation under /docs/src/content/docs/metrics/Code/Deterministic/ for new SQL metrics.
Added tests for new SQL metrics in tests/code_metrics_test.py.
Updated pyproject.toml to include sqlglot as an optional dependency.

Generated with ❤️ by ellipsis.dev

… test file

ellipsis-dev

👍 Looks good to me! Reviewed everything up to 85efa36 in 56 seconds

More details

Looked at 368 lines of code in 12 files
Skipped 1 files when reviewing.
Skipped posting 1 drafted comments based on config settings.

1. README.md:125

Draft comment:
LGTM. The new SQL metrics are well integrated and documented.
Reason this comment was not posted:
Confidence changes required: 0%
The PR adds new metrics for SQL code evaluation, specifically SQLSyntaxMatch and SQLASTSimilarity. The implementation seems to correctly handle the formatting and AST comparison for SQL queries. The documentation and tests are updated accordingly to reflect these new metrics. The PR also includes necessary dependency updates in pyproject.toml for sqlparse and sqlglot, which are used in the SQL metrics. Overall, the PR appears to be well-structured and addresses the intended functionality of adding SQL metrics.

Workflow ID: wflow_c4y8tPTwiBNh7Dyg

You can customize Ellipsis with 👍 / 👎 feedback, review rules, user-specific overrides, quiet mode, and more.

pantonante · 2024-05-21T15:03:59Z

continuous_eval/metrics/code/sql/sql_deterministic_metrics.py

I would drop the sql from the name, the file is already in the sql folder making it "obvious"

pantonante · 2024-05-21T15:05:27Z

continuous_eval/metrics/code/sql/sql_deterministic_metrics.py

@@ -0,0 +1,77 @@
+from typing import List, Union
+
+import sqlparse


Can avoid sqlparse to format the query? Can we use sqlglot only. It seems sqlglot can format but also optimize in a common format https://github.com/tobymao/sqlglot?tab=readme-ov-file#sql-optimizer

pantonante · 2024-05-21T15:07:44Z

continuous_eval/metrics/code/sql/sql_deterministic_metrics.py

+        if isinstance(change, Keep):
+            return 0
+        elif isinstance(change, Update):
+            return 1.5  # Updates are significant as they imply a modification in function or value.
+        elif isinstance(change, Insert) or isinstance(change, Remove):
+            return 1  # Inserts and Removes affect the structure and content but are simpler than updates.
+        elif isinstance(change, Move):
+            return 0.5  # Moves are generally less impactful as they simply change the order.
+        return 1  # Default weight for other types of changes


Weights should be modifiable through some configuration parameters (see

continuous-eval/continuous_eval/metrics/generation/text/deterministic.py

Lines 12 to 14 in 344f7e9

class DeterministicFaithfulnessConfig:

rouge_precision_threshold: float = 0.5

token_overlap_precision_threshold: float = 0.5

)

pantonante · 2024-05-21T15:09:01Z

pyproject.toml

+sqlparse = "^0.5.0"
+sqlglot = "^23.17.0"


I think these packages should be optional/extra

pantonante · 2024-05-21T15:10:05Z

continuous_eval/metrics/code/sql/sql_deterministic_metrics.py

+        return {"SQL_Syntax_Match": max_match_score}
+
+
+class SQLASTSimilarity(Metric):


How do these two metrics handle comments?

ellipsis-dev

👍 Looks good to me! Incremental review on da2d38b in 4 minutes and 0 seconds

More details

Looked at 329 lines of code in 8 files
Skipped 1 files when reviewing.
Skipped posting 0 drafted comments based on config settings.

Workflow ID: wflow_wXck8lQebKerzqVB

You can customize Ellipsis with 👍 / 👎 feedback, review rules, user-specific overrides, quiet mode, and more.

ellipsis-dev

👍 Looks good to me! Incremental review on 6eb410a in 2 minutes and 54 seconds

More details

Looked at 264 lines of code in 5 files
Skipped 1 files when reviewing.
Skipped posting 0 drafted comments based on config settings.

Workflow ID: wflow_cNrQQ3jGozSZ5BIb

You can customize Ellipsis with 👍 / 👎 feedback, review rules, user-specific overrides, quiet mode, and more.

Ubuntu and others added 11 commits May 16, 2024 18:07

Add initial SQL metrics implementation

da8a897

Add PR description for SQL metrics implementation

6b7bd38

Add documentation for SQLSyntaxMatch class and update tests

b362c54

Move SQL metrics documentation to the specified directory and add new…

5c405ba

… test file

Remove partial match test and update documentation

c0920e4

Remove test_partial_match from SQL metrics tests

53cd8b5

Delete PR_DESCRIPTION.md as per user's request

892bc2b

Implement SQL AST comparison metric using sqlglot

89a51d4

Add SQLASTSimilarity class for AST-based SQL query comparison

5167376

update sql metrics

671ac55

update docs & resolve merge conflicts

85efa36

yisz requested a review from pantonante May 21, 2024 03:00

yisz self-assigned this May 21, 2024

ellipsis-dev bot reviewed May 21, 2024

View reviewed changes

pantonante requested changes May 21, 2024

View reviewed changes

revamp sql metrics

da2d38b

yisz requested a review from pantonante May 22, 2024 16:00

ellipsis-dev bot reviewed May 22, 2024

View reviewed changes

Minor changes

6eb410a

pantonante merged commit 8124a22 into main May 22, 2024

pantonante deleted the add-sql-metrics branch May 22, 2024 20:08

ellipsis-dev bot reviewed May 22, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add sql metrics#63

Add sql metrics#63
pantonante merged 13 commits intomainfrom
add-sql-metrics

yisz commented May 21, 2024 •

edited by ellipsis-dev bot

Loading

Uh oh!

ellipsis-dev bot left a comment

Uh oh!

pantonante May 21, 2024

Uh oh!

pantonante May 21, 2024

Uh oh!

pantonante May 21, 2024 •

edited

Loading

Uh oh!

pantonante May 21, 2024

Uh oh!

pantonante May 21, 2024

Uh oh!

ellipsis-dev bot left a comment

Uh oh!

ellipsis-dev bot left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		@@ -0,0 +1,77 @@
		from typing import List, Union

		import sqlparse

	class DeterministicFaithfulnessConfig:
	rouge_precision_threshold: float = 0.5
	token_overlap_precision_threshold: float = 0.5

		return {"SQL_Syntax_Match": max_match_score}


		class SQLASTSimilarity(Metric):

Conversation

yisz commented May 21, 2024 • edited by ellipsis-dev bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary:

Uh oh!

ellipsis-dev bot left a comment

Choose a reason for hiding this comment

Uh oh!

pantonante May 21, 2024

Choose a reason for hiding this comment

Uh oh!

pantonante May 21, 2024

Choose a reason for hiding this comment

Uh oh!

pantonante May 21, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

pantonante May 21, 2024

Choose a reason for hiding this comment

Uh oh!

pantonante May 21, 2024

Choose a reason for hiding this comment

Uh oh!

ellipsis-dev bot left a comment

Choose a reason for hiding this comment

Uh oh!

ellipsis-dev bot left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

yisz commented May 21, 2024 •

edited by ellipsis-dev bot

Loading

pantonante May 21, 2024 •

edited

Loading