asail77's comments

asail77 · 2025-09-10T15:46:06 1757519166

A good model for planner seems pretty important, what models are best?

saqadri · 2025-09-10T16:05:54 1757520354

OP here -- I think the general principle I would recommend is using a big reasoning model for the planning phase. I think Claude Code and other agents do the same. The reason this is important is because the quality of the plan really affects the final result, and error rates will compound if the plan isn't good.

haniehz · 2025-09-10T15:53:52 1757519632

based on the article, it seems like a good reasoning model like gpt5 or opus 4.1 might be good choices for the planner. I wonder if the gpt oss reasoning models would do well

diggan · 2025-09-12T18:06:46 1757700406

Personally been using GPT-OSS-120b locally with reasoning_effort set to `high` and it blows pretty much every other local model out of the water, but takes a lot of time for it to eventually do a proper content reply. But for fire-and-forget jobs like "Create a well-researched report on X from perspective Y" it works really well.

cyberninja15 · 2025-09-12T19:45:28 1757706328

what machine are you running GPT-OSS-120B on? I'm currently only able to get GPT-OSS-20B working on my macbook using Ollama

koakuma-chan · 2025-09-12T17:15:34 1757697334

Gemini 2.5 Pro is also a great reasoning model, I still prefer it over GPT 5

luckydata · 2025-09-12T19:30:32 1757705432

Gemini is great, it's just incredibly clumsy at tool use and that's why it fails so often in practice. I'm looking forward to the next version, it will for sure address it, it's a big issue internally too (I'm a recent xoogler).

reachableceo · 2025-09-12T20:46:07 1757709967

Yes it really is horrible at using tools. Codex is way better (even better than Claude code ). Gemini is great at doing audits and content (though I’ve switched to codex for everything all in one).

PantaloonFlames · 2025-09-12T20:26:16 1757708776

Can you elaborate on “clumsy at tool use”?

luckydata · 2025-09-13T01:17:17 1757726237

have you ever witnessed how sometimes Gemini makes multiple attempts at writing a file only to give up and start chanting "I'm worthless...".

That's tool use failure :)

koakuma-chan · 2025-09-12T19:54:44 1757706884

I'm excited for the next version!

asail77 · 2025-08-18T16:57:49 1755536269

Give it a try for yourself. It's free!

We have been working on this problem since 2020 and have created an trained an ensemble of AI detection models working together to tell you what is real and what is fake!

Lerc · 2025-08-19T10:41:13 1755600073

I tried, It required making an account to use.

In this day and age, everybody realises that forcing people to make an account does not count as free. It is paying with personal information.

bpcrd · 2025-08-19T19:04:57 1755630297

I completely agree! Please see the bottom of our post where we offer free access in two ways:

1) Email up to 50 files to yc@realitydefender.com, we’ll scan them for you, no setup required

2) 1-click add to Zoom/Teams (via Appstore) to try detection live in your own calls immediately

bpcrd · 2025-08-19T19:05:12 1755630312

https://marketplace.zoom.us/apps/OYu4CZuRSwy_ieJ-6xKcrA

asail77 · on March 25, 2022

Great discussions! Some thoughts:

First, there are multiple solutions to this problem, all need to be explored and many will have a part in the future. Bad actors will do everything they can to find a way around every solution.

Second, for the hashing approach, have a look at "Perceptual Hashing", it's a way to hash content like a an image, even if the resolution changes.

Where to store the hashes? A centralized server is probably fine, but there is always a risk of a bad actor exposing it somehow. A secure blockchain can work better. But if you go that route, might as well go with the most secure blockchain. POW is generally more secure than POS. And currently the most secure blockchain is Bitcoin. So one solution is to batch hashes together and write them with a Bitcoin transaction in some cadence.

-Ali

asail77 · on March 23, 2022

Thank you!

asail77 · on March 23, 2022

Great points! This is a problem that must be tackled from multiple directions. Different tools that flag content and general education.

We plan to guide our users in this way, incorporating these different tools and providing general education.

asail77 · on March 23, 2022

Thank you! We hope we can help make the world a safer place!

asail77 · on March 23, 2022

Awesome! We will look them up! And happy to chat with you. You can also email us at ask@realitydefender.ai

Thank you!

asail77 · on March 23, 2022

Great discussion. Many have asked the same questions.

We hope a standard will be created and used by all digital content creation tools. But this will take time. And even then, bad actors will not be deterred. They will always find ways to create fake content and pass it as authentic. We want to be there to fight them every step of the way!

rockemsockem · on March 23, 2022

There is a standard for this actually. Hot(ish) off the press: https://c2pa.org/

asail77 · on March 23, 2022

We hope a standard will be created and used by all digital content creation tools. But this will take time. And even then, bad actors will not be deterred. They will find ways to create fake content and pass it as authentic. We want to be there to fight them every step of the way!