> I need a way to compare the simulated output for the clients to the actual output of the endpoint.
How often is the comparison made? In practice how do you achieve this in a big org? My team has services that talk to other services, some owned by my team, some owned by other teams in the org and some owned by 3rd party.