hey all I m getting a really weird bug from OpenAI when I pr Cerebral Valley #06-technical-discussion

hey all - I'm getting a really weird bug from Open...

Brett Goldstein

06/07/2023, 12:29 AM

hey all - I'm getting a really weird bug from OpenAI. when I prompt it with something like "please summarize this text" I get repeating words like this: "I'm proficient in Python, SQL, and machine/deep learning, and I'm passionate, and I'm a quick, and, and, and, and, and, and, and, and, and, and, and, and, and, and, and, and, and, and, and, and, and, and, and, and, and, and, and, and, and, and, and, and, and, and, and, and, and, and, and, and, and, and, and, and, and, and, and, and, and, and, and, and, and, and, and, and, and, and, and, and, and, and, and, and, and," any ideas?

Don Alvarez

06/07/2023, 1:03 AM

Any chance you're using Semantic-Kernel? There was a bug in earlier versions that could cause issues like that. If you're not using Semantic-Kernel, I think the root cause was passing an invalid model identifier (eg. not gpt-3.5-turbo or etc) to OpenAI in your API calls

Brett Goldstein

06/07/2023, 1:05 AM

this is coming straight out of the OpenAI api so no Semantic Kernel

Don Alvarez

06/07/2023, 1:06 AM

Right but could (possibly) have the same root cause of a bad model specifier being sent to OpenAI by the caller

Brett Goldstein

06/07/2023, 1:06 AM

got it. so basically just a typo or something when setting the model?

Don Alvarez

06/07/2023, 1:07 AM

My guess in the situation I encountered was OpenAI was somehow defaulting to a crappy ancient internal test model (there are around 100? models that can be specified by the API).

Don Alvarez

06/07/2023, 1:08 AM

I can't guarantee it's a bad model specifier, but I was definitely getting garbage that looked like.

Brett Goldstein

06/07/2023, 1:09 AM

hm looks good to me

Don Alvarez

06/07/2023, 1:09 AM

yeah, that does look sensible

Don Alvarez

06/07/2023, 1:10 AM

does gpt-4 behave better?

Brett Goldstein

06/07/2023, 1:11 AM

i dont think so

Brett Goldstein

06/07/2023, 1:12 AM

there are so many summary products out there. we have to be doing something wrong haha

Don Alvarez

06/07/2023, 1:12 AM

I haven't played with contexts that can overflow the working memory of the GPU. Is it possible you're sending it too big a document (hopefully they return a sensible error but since I've never done that I don't know personally how they handle it)

Brett Goldstein

06/07/2023, 1:14 AM

we're sending emails over so honsetly not crazy

Don Alvarez

06/07/2023, 1:14 AM

^ pure guess on that last one, just trying to be helpful but probly not succeeding 🙂

Brett Goldstein

06/07/2023, 1:14 AM

yea haha

Brett Goldstein

06/07/2023, 1:14 AM

apprecaite the help

Brett Goldstein

06/07/2023, 1:14 AM

better than banging my head against this

Brett Goldstein

06/07/2023, 1:14 AM

do you know if OpenAI has support?

Don Alvarez

06/07/2023, 1:15 AM

My guess is Azure's version is more likely to have support than OpenAI (same models and APIs, just supported by a different org and billed through a different company)

Brett Goldstein

06/07/2023, 1:16 AM

Don Alvarez

06/07/2023, 1:16 AM

(I've also heard the Azure variant has significantly faster response times, but I haven't confirmed that myself)

Brett Goldstein

06/07/2023, 1:16 AM

we're on AWS

Don Alvarez

06/07/2023, 1:17 AM

The Azure folks probably wont mind an opportunity to try to convince you of their goodness lol

Brett Goldstein

06/07/2023, 1:17 AM

yea haha

Don Alvarez

06/07/2023, 1:18 AM

OpenAI might have support as well, they're just a newer and smaller company and good support orgs tend to take time to build

Don Alvarez

06/07/2023, 1:18 AM

Anyway I'm well out of my domain of actual knowledge here

Brett Goldstein

06/07/2023, 1:20 AM

yea - im gonna bug my investors to get in front of openai and see if that works lol

Brett Goldstein

06/07/2023, 1:21 AM

OpenAI discord was useless

Vishwanath Seshagiri

06/07/2023, 1:25 AM

This is a hallucination error, modify the prompt to let it know the tone that it needs to use or use CoT prompting. For eg: Summarize the text such that it can be used in my LinkedIn profile, and give reasons for why.

Vishwanath Seshagiri

06/07/2023, 1:25 AM

Try this, and let me know if it works out! 🙂

Brett Goldstein

06/07/2023, 1:27 AM

whats CoT?

Vishwanath Seshagiri

06/07/2023, 2:13 AM

Chain of thought prompting

Brett Goldstein

06/07/2023, 2:15 AM

Brett Goldstein

06/07/2023, 2:15 AM

weve tried a lot of stuff like "please do not repeat words over and over again" lol

Vishwanath Seshagiri

06/07/2023, 2:26 AM

Can you try what I suggested and let me know?

Brett Goldstein

06/07/2023, 2:35 AM

yea will have to wait for tomorrow

Brett Goldstein

06/07/2023, 2:35 AM

lead eng is out. im the designer lol

Brett Goldstein

06/07/2023, 2:35 AM

& founder

Leon Wu

06/07/2023, 4:00 AM

check your frequency penalty. also are you using chat or completions api? feel like chat has been RLHF'ed more so it's less likely to do this whereas completions is probably closer to the original models but tends to fall into pits like this

Leon Wu

06/07/2023, 4:03 AM

i don't think it's a hallucination problem because this isn't an issue of truthfulness, and i'm pretty sure the api would return an error message if you did manage to exceed the context window (like, just ML-wise, transformers on their own just dont take any wrong sized input)

Leon Wu

06/07/2023, 4:06 AM

ok yeah found your post in the discord. i think it's 100% a frequency and presence penalty bug. these penalties REDUCE the likelihood of words being repeated (frequency penalizes more common words more heavily, whereas presence is a flat penalty for any word that appears at least once). your penalty is negative, which means you're encouraging it to repeat itself. so, to fix: just change both to some number that's not negative. 0 is the default for both in openai playground and that works fine for me generally

🔥 1

Leon Wu

06/07/2023, 4:09 AM

(edit: lol playground doens't even let you use negative penalties)

Leon Wu

06/07/2023, 4:10 AM

(also, as a side note, if you want text summaries, wouldn't you want temperature as low as possible ie. 0 to get maximum factuality and minimize creativity?)

Brett Goldstein

06/07/2023, 4:19 AM

omg thank you!!

Brett Goldstein

06/07/2023, 4:19 AM

that sounds about right

Brett Goldstein

06/07/2023, 4:20 AM

we've been busy building out other pieces of infra so we have kinda left the parameters of the openai calls unchecked but this is super helpful

Brett Goldstein

06/07/2023, 4:20 AM

will fix penalty and temp and let you know how it goes

🫡 1

Brett Goldstein

06/13/2023, 2:59 AM

gg yalll did it @Leon Wu!

Brett Goldstein

06/13/2023, 2:59 AM

issue resolved

🔥 1

🧠 1

Leon Wu

06/13/2023, 5:11 AM

yesss let's goo!!!

Open in Slack

Previous Next