Handling Concurrent LLM Requests: Caching and Processing Considerations

Question

What happens when I send the same LLM request before the initial one has finished processing? Does helicone cache and return the initial promise for the second request or both will be processed separately?

Helicone Community Page

Handling Concurrent LLM Requests: Caching and Processing Considerations