Align MSTest v2→v3 eval assertions/rubric with fixture semantics#550
Conversation
Agent-Logs-Url: https://github.com/dotnet/skills/sessions/055e6526-4b7d-4457-9004-2c8c99969cd1 Co-authored-by: Evangelink <11340282+Evangelink@users.noreply.github.com>
|
/evaluate |
There was a problem hiding this comment.
Pull request overview
This PR updates the MSTest v2→v3 evaluation scenario to better align the scenario’s assertions/rubric with what the fixture (fixtures/v2-nuget/UserServiceTests.cs) actually exercises, focusing on Assert.AreEqual/Assert.AreSame object-overload removal and generic type parameter migration guidance.
Changes:
- Added an
output_matchesassertion intended to gate on guidance aboutAssert.AreEqual/Assert.AreSameoverload removal and generic type parameter migration. - Updated the scenario rubric wording from
AreNotEqualtoAreSameand made it fixture-specific.
Show a summary per file
| File | Description |
|---|---|
| tests/dotnet-test/migrate-mstest-v1v2-to-v3/eval.yaml | Tightens Goal 2 scenario assertions and adjusts rubric wording to match the v2 NuGet fixture semantics. |
Copilot's findings
Tip
Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.
- Files reviewed: 1/1 changed files
- Comments generated: 1
Skill Validation Results
[1] Model: claude-opus-4.6 | Judge: claude-opus-4.6 🔍 Full Results - additional metrics and failure investigation steps ▶ Sessions Visualisation -- interactive replay of all evaluation sessions |
Co-authored-by: Copilot Autofix powered by AI <175728472+Copilot@users.noreply.github.com>
Skill Coverage Report
Uncovered:
|
There was a problem hiding this comment.
Pull request overview
This PR tightens the MSTest v2→v3 migration eval scenario so its automated assertions and rubric align with what the fixture (fixtures/v2-nuget/UserServiceTests.cs) actually demonstrates (AreEqual/AreSame object-overload migration + generic type parameter guidance), avoiding checks for unrelated assertions.
Changes:
- Added
output_matchesassertions requiring the agent output to mentionAssert.AreEqual/Assert.AreSameand to discuss object-overload removal / generic type parameter migration. - Updated the scenario rubric item to reference
Assert.AreEqual/AreSame(instead ofAreNotEqual) and tie it toUserServiceTests.cs.
Show a summary per file
| File | Description |
|---|---|
| tests/dotnet-test/migrate-mstest-v1v2-to-v3/eval.yaml | Tightens Goal 2 assertions to require AreEqual/AreSame + overload-removal/generic-parameter guidance and updates rubric wording to match the fixture. |
Copilot's findings
Tip
Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.
- Files reviewed: 1/1 changed files
- Comments generated: 0
|
/evaluate |
Skill Validation Results
[1] Model: claude-opus-4.6 | Judge: claude-opus-4.6 🔍 Full Results - additional metrics and failure investigation steps ▶ Sessions Visualisation -- interactive replay of all evaluation sessions |
…net#550) * Initial plan * Align MSTest v2->v3 eval rubric with fixture assertions Agent-Logs-Url: https://github.com/dotnet/skills/sessions/055e6526-4b7d-4457-9004-2c8c99969cd1 Co-authored-by: Evangelink <11340282+Evangelink@users.noreply.github.com> --------- Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com> Co-authored-by: Evangelink <11340282+Evangelink@users.noreply.github.com>
This PR addresses the unresolved review thread on
tests/dotnet-test/migrate-mstest-v1v2-to-v3/eval.yamlby making the v2→v3 migration scenario consistent with what the fixture actually exercises. The goal is to avoid evaluating forAreNotEqualwhen the fixture demonstratesAreEqual/AreSameoverload migration concerns.Scenario assertion tightening (Goal 2: v2 NuGet → v3)
output_matchesregex that explicitly checks for guidance onAssert.AreEqual/Assert.AreSameobject-overload removal and generic type parameter migration language.Rubric/fixture alignment
Assert.AreEqual/AreNotEqualtoAssert.AreEqual/AreSame, matchingfixtures/v2-nuget/UserServiceTests.cs.