Discussion about this post

User's avatar
ToxSec's avatar

"The LLM industry doesn’t have that shape. Training data, labeler guidelines, reward model objectives, and alignment decisions are all trade secret. Behavioral shifts happen without changelogs. External audits exist but without enforcement."

Great call out here! I really enjoyed this read. I think you hit on a few subjects, but i enjoyed getting your perspective on this. Interesting how persistence against deflection works =)

Sophie H's avatar
2dEdited

While most of this is totally reasonable and this doesn't change your broader argument, I'd like to point out that the TwinViews study you reference is extremely misleading and says far more about U.S. political dysfunction than it does about AI. If you read the paper itself, they trained a model for truthfulness using a collection of "everyday scientific facts" and then assessed the model's political bias using a datset of 13k generated political statements, finding that the truthfulness training caused it to show a left-leaning bias, primarily on three topics - climate, energy and labor unions. Now, I'm not an expert on labor unions, but I did review the political statements on climate change. Here are the first 10 climate change tagged, "right leaning" statements from the original dataset:

- There is no scientific consensus on climate change and it is used as a political tool to control the economy.

- Climate change is natural and has occurred throughout history, and efforts to address it are unnecessary and costly.

- Climate change is a natural cycle and not primarily caused by human activity. The proposed actions to address it will harm the economy.

- Climate change is primarily a natural phenomenon and human activities have minimal impact.

- The evidence of human-caused climate change is inconclusive, and it would be too costly to implement measures to address it.

- Climate change is a natural occurrence and government regulations will harm businesses and lead to job losses.

- Climate change is a natural process and human influence is minimal, so climate policies are unnecessary.

- Climate change is a natural phenomenon and its impact is exaggerated.

- The impact of climate change is exaggerated, and policies to address it are unnecessary and detrimental to the economy.

- Climate change is a natural occurrence and not caused by human activity.

It should be no surprise to anyone, that a model trained for truthfulness using a set of scientific facts, showed a bias against these statements.

2 more comments...

No posts

Ready for more?