tracetest: [EPIC][Error Handling] Test Run Page Error Handling Improvements

	CREATED	TRIGGERING	CONNECTING_TO_DATA_STORE	POLLING_TRACE	GENERATING_OUTPUTS	RUNNING_TEST_SPECS	FINISHED
Successful	Run Page	Trigger response data - body - timing - headers	Signal of successful connection to data store	Trace	Outputs	Test Specs results	Trigger/Trace/Test
Failed	Failed Page	Breakdown of the trigger problem - DNS connection - Queue connection - Auth problems	Similar to the test connection endpoint show breakdown of issues	Breakdown of the trace fetching with Reason of the error	Warning that the generation of the output failed And the reason why	Failed Test specs	Global Failed state
In Progress	Loading state	Loading state with trigger steps	Loading state	Similar to the server output - Polling iteration # - # of spans - Reason for next iteration	Loading state	Loading state	Loading state

Most upvoted comments

Yes yes. This is our second to the top priority, only behind knocking out the configuration work which I want the team to swarm on as it is blocking other activity. If we get to a spot where @jorgeepc or @xoscar do not have an area they can contribute to the config changes, we will want to focus on this.

kdhamric on Mar 3, 2023

Hello every one, here’s my take on what should be added to the test run page to improve the user experience

Acceptance Criteria: AC1 As a user looking at the test run page And I just ran the test And the test failed in the initial trigger request (HTTP, GRPC, etc…) I should be able to see a breakdown of the error and steps that occurred prior to seeing the error

AC2 As a user looking at the test run page And I just ran a test and the initial request worked as expected And the app is trying to fetch the trace I should be able to see a description of what the app is doing in the background, things like:

What # of polling retry is it
In what state is the polling (waiting, polling, failed)
Recent errors or reasons why a new poll was triggered (even if the trace was already found)

AC3 As a user looking at the test run page And I just ran a test and the initial request worked as expected And the app failed to fetch the trace I should be able to see a proper error description of what happened, what was done to try to fetch the trace And I should be able to see the initial request/response details

The idea with this is to allow users to have easier ways to debug what’s happening within the system, if we found a problem or if something else is happening. This can also help them tweak their polling settings to have the best result for them

CC: @olha23

xoscar on Mar 6, 2023

Added some comments, if we are in the clear about config stuff I will start working on this Monday morning!

xoscar on Mar 3, 2023

tracetest: [EPIC][Error Handling] Test Run Page Error Handling Improvements

Test Run Flow Chart

State Matrix for Test Runs

Tickets and Tasks

Follow-up release

Nice to have

Mockups

About this issue

Most upvoted comments