Scotter

Since 2024/06/06 at 11:30pm UTC cache: true on embeddings hangs for 335-340 seconds

I've been using Helicone for awhile and I'm sure these are on queries that are cache hits given the content/context. Here's the curl request:

Plain Text

curl -X "POST" "https://oai.hconeai.com/v1/embeddings" \
     -H 'Content-Type: application/json' \
     -H 'Authorization: Bearer sk-XXX' \
     -H 'Openai-Organization: org-XXX' \
     -H 'Helicone-Auth: Bearer sk-XXX' \
     -H 'Helicone-Cache-Enabled: true' \
     -d $'{
  "model": "text-embedding-ada-002",
  "input": "Can I drink alcohol while pregnant?"
}'

If I switch cache enabled to false it returns as expected.

13 comments

SScotter

Partial chat stream chunks when using Helicone w/ Ruby's OpenAI Gem

Hey Helicone team, I wanted to bring this to your attention. Ruby dev's are finding that using Helicone as a proxy is interfering with the chat stream. https://github.com/alexrudall/ruby-openai/issues/251

11 comments

SScotter

Double check on whether Helicone API returns html for 503 errors

Recently my openai client has been erroring because of trying to JSON parse an NGINX html 503 response. I'm using Helicone and pretty sure that it's not passing a 503 html page along from OpenAI but serving it's own NGINX 503. If this is the case, I don't think you're intending to serve HTML over a JSON API

SScotter

My dashboard of requests is consistently delayed by about 48 hours

Is this intended? My account might be in a weird state since I signed up really early.

8 comments

Helicone Community Page

Since 2024/06/06 at 11:30pm UTC cache: true on embeddings hangs for 335-340 seconds

Partial chat stream chunks when using Helicone w/ Ruby's OpenAI Gem

Double check on whether Helicone API returns html for 503 errors

My dashboard of requests is consistently delayed by about 48 hours