I've been using Helicone for awhile and I'm sure these are on queries that are cache hits given the content/context. Here's the curl request:
curl -X "POST" "https://oai.hconeai.com/v1/embeddings" \
-H 'Content-Type: application/json' \
-H 'Authorization: Bearer sk-XXX' \
-H 'Openai-Organization: org-XXX' \
-H 'Helicone-Auth: Bearer sk-XXX' \
-H 'Helicone-Cache-Enabled: true' \
-d $'{
"model": "text-embedding-ada-002",
"input": "Can I drink alcohol while pregnant?"
}'
If I switch cache enabled to false it returns as expected.