Add math test by yiranwu0 · Pull Request #555 · microsoft/autogen

yiranwu0 · 2023-11-05T04:11:58Z

Why are these changes needed?

Add Contrib test for math proxy agent.
remove [mathchat] dependency in build test flow.
Putting tests for different contrib agents in one job. Previous test fail will not block sequential tests.

Example that previous test fails doesn’t block sequential tests
https://github.com/microsoft/autogen/actions/runs/6759692725/job/18372611806 
(Install packages and dependencies for RetrieveChat -> fails
Run “Install packages and dependencies for MathChat” -> success
Run “test MathChat” -> success
Result is still error
)

When previous dependency fails, corresponding testing will be disabled.
https://github.com/microsoft/autogen/actions/runs/6759701280/job/18372633270?pr=555
(Run “ Install packages and dependencies for RetrieveChat” -> fail
Run “”Install packages and dependencies for MathChat ->fail
skipping test MathChat)

Related issue number

Checks

I've included any doc changes needed for https://microsoft.github.io/autogen/. See https://microsoft.github.io/autogen/docs/Contribute#documentation to build and test documentation locally.
I've added tests (if relevant) corresponding to the changes introduced in this PR.
I've made sure all auto checks have passed.

codecov-commenter · 2023-11-05T04:13:15Z

Codecov Report

Merging #555 (9735537) into main (fda7a39) will increase coverage by 4.88%.
The diff coverage is n/a.

@@            Coverage Diff             @@
##             main     #555      +/-   ##
==========================================
+ Coverage   32.40%   37.29%   +4.88%     
==========================================
  Files          27       27              
  Lines        3357     3357              
  Branches      756      756              
==========================================
+ Hits         1088     1252     +164     
+ Misses       2173     1989     -184     
- Partials       96      116      +20

Flag	Coverage Δ
unittests	`37.23% <ø> (+4.88%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

see 3 files with indirect coverage changes

thinkall · 2023-11-05T09:06:58Z

+          AZURE_OPENAI_API_BASE: ${{ secrets.AZURE_OPENAI_API_BASE }}
+          OAI_CONFIG_LIST: ${{ secrets.OAI_CONFIG_LIST }}
+        run: |
+          coverage run -a -m pytest test/agentchat/contrib/test_math_user_proxy_agent.py


Looks like the output of coverage run will overwrite the output of the former step. In the end, only one step of coverage will be uploaded.

Could we add different names to the output of different steps and combine them at the upload step?

The coverage run -a will append outputs to the same file(.coverage), so we are good with this. Not very sure but this is what I find.

sonichi · 2023-11-05T13:51:44Z


 jobs:
-  RetrieveChatTest:
+  OpenAI4ContribTests:


One problem with this approach is that the previous installed dependencies will remain in the later steps. That makes the test environment not clean for later steps. Is it possible to reset the environment for later steps?

I haven't found any good way to clear up the environment. A possible solution is to use virtual env, but different os needs different syntax, which makes it complex. The other way is just putting them in different jobs.

I guess there are two problems to consider here:

Is it ok to NOT having a clean env? Will there be a case that user does pip install autogen[retrievechat, mathchat], and they expect things just work?

I suggested that separating the jobs would be tedious to look in github actions, which is why I change this in the first place. But we might need to revisit the problem: do you think it is indeed a issue, or we can bear with it?

I think it's pretty important for the automatic jobs to test different contributed agents in different environments, each with just its own dependencies, whether this is done through separate virtual envs, different jobs, or some other mechanism. Wish I had more experience in setting up such tests.

coverage run -a -m pytest test/test_retrieve_utils.py test/agentchat/contrib

will test all files in the contrib folder, while only dependencies for Retrievechat is installed. This will result in error in the future if a contrib agent requires other packages to be installed.
https://github.com/microsoft/autogen/blob/fda7a39dd9ede1c115d37c2a454f55b7f22c8193/.github/workflows/contrib-tests.yml#L52C38-L52C38

update

d39e293

yiranwu0 had a problem deploying to openai1 November 5, 2023 04:12 — with GitHub Actions Failure

update

84c303c

yiranwu0 had a problem deploying to openai1 November 5, 2023 04:20 — with GitHub Actions Failure

update

a11831a

yiranwu0 had a problem deploying to openai1 November 5, 2023 04:33 — with GitHub Actions Failure

update

b759541

yiranwu0 had a problem deploying to openai1 November 5, 2023 04:49 — with GitHub Actions Failure

update

6e34f78

yiranwu0 had a problem deploying to openai1 November 5, 2023 04:55 — with GitHub Actions Failure

update

69b253a

yiranwu0 had a problem deploying to openai1 November 5, 2023 05:35 — with GitHub Actions Failure

yiranwu0 had a problem deploying to openai1 November 5, 2023 06:09 — with GitHub Actions Failure

update

3210eee

yiranwu0 had a problem deploying to openai1 November 5, 2023 06:26 — with GitHub Actions Failure

yiranwu0 had a problem deploying to openai1 November 5, 2023 06:27 — with GitHub Actions Failure

check when dependency fail

86ae46d

yiranwu0 had a problem deploying to openai1 November 5, 2023 06:28 — with GitHub Actions Failure

revision

189a4d3

yiranwu0 had a problem deploying to openai1 November 5, 2023 06:35 — with GitHub Actions Failure

thinkall reviewed Nov 5, 2023

View reviewed changes

sonichi reviewed Nov 5, 2023

View reviewed changes

revise

c0dfd3d

yiranwu0 had a problem deploying to openai1 November 5, 2023 16:36 — with GitHub Actions Failure

Merge branch 'main' into mathtest

9396960

yiranwu0 had a problem deploying to openai1 November 7, 2023 19:14 — with GitHub Actions Failure

Merge remote-tracking branch 'origin/main' into mathtest

9735537

yiranwu0 mentioned this pull request Nov 25, 2023

[Core] Compression in GroupChat #497

Closed

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add math test#555

Add math test#555
yiranwu0 wants to merge 18 commits into
mainfrom
mathtest

yiranwu0 commented Nov 5, 2023 •

edited

Loading

Uh oh!

codecov-commenter commented Nov 5, 2023 •

edited

Loading

Uh oh!

thinkall Nov 5, 2023

Uh oh!

thinkall Nov 5, 2023

Uh oh!

yiranwu0 Nov 5, 2023

Uh oh!

Uh oh!

sonichi Nov 5, 2023

Uh oh!

yiranwu0 Nov 5, 2023 •

edited

Loading

Uh oh!

rickyloynd-microsoft Nov 5, 2023

Uh oh!

yiranwu0 Nov 6, 2023 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

yiranwu0 commented Nov 5, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Why are these changes needed?

Related issue number

Checks

Uh oh!

codecov-commenter commented Nov 5, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

thinkall Nov 5, 2023

Choose a reason for hiding this comment

Uh oh!

thinkall Nov 5, 2023

Choose a reason for hiding this comment

Uh oh!

yiranwu0 Nov 5, 2023

Choose a reason for hiding this comment

Uh oh!

Uh oh!

sonichi Nov 5, 2023

Choose a reason for hiding this comment

Uh oh!

yiranwu0 Nov 5, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

rickyloynd-microsoft Nov 5, 2023

Choose a reason for hiding this comment

Uh oh!

yiranwu0 Nov 6, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

yiranwu0 commented Nov 5, 2023 •

edited

Loading

codecov-commenter commented Nov 5, 2023 •

edited

Loading

yiranwu0 Nov 5, 2023 •

edited

Loading

yiranwu0 Nov 6, 2023 •

edited

Loading