logoalt Hacker News

_boffin_yesterday at 11:54 PM5 repliesview on HN

The thing that I keep thinking about is the accounting / charging when it downgrades automatically.

Do they adjust the price of the api request so that only the tokens that were utilized by fable get charged at that price and the remaining tokens that the cheaper / nerfed (fable) model utilizes get charged at that price?

If the answer is no, could that be construed as fraud?


Replies

CGamesPlaytoday at 1:18 AM

The announcement elucidated this, and it's IMO worse than this. They don't downgrade to a cheaper model ([edit] for certain classes of offense they suspect you of). They sabotage the model's outputs in other, undisclosed, ways (specifically, "prompt modification, steering vectors, or parameter-efficient fine-tuning"). So, for example, they might load in a steering vector that just forgets the API to PyTorch. But it isn't just "we redirected you to a cheaper model!"

show 2 replies
tfirsttoday at 12:27 AM

Their goal is to downgrade people who are violating their TOS, so I think they'd have some argument there. I have no idea how they'll deal with inevitable false positives, especially given how oversensitive most of the other triggers are.

show 6 replies
robrenaudtoday at 12:16 AM

They use a lightweight adapter to silently degrade the performance. Usually these adaptors are made to improve the performance for a given domain/task.

garciasntoday at 12:32 AM

It royally pissed me off today by just continuing with credits without stopping to ask me if I was ok with it.

Ran up $30 in extra charges while it was just flashing on the screen that it was doing that after I walked away to do something while it was humming along.

It has always just told me I ran out of usage and had to wait before. Now? You’re just gonna pay extra because you left it unattended as you’ve done for the last year of use.

show 3 replies
golem14today at 7:05 AM

If the answer is yes, can you figure out when the switched models by looking at the itemized bill?