Training our own AI models
by tartieret on 5/27/2026, 4:08:42 PM
https://posthog.com/blog/training-ai-models
Comments
by: JimDabell
“Opt-in by default” is an oxymoron. If it’s default then I haven’t opted into anything. It’s been <i>enabled</i> by default.
5/27/2026, 4:36:08 PM
by: aabhay
This should be a lesson in bad communication. Not being clear about whats being trained on is a huge mistake. And this announcement really puts into focus the drawbacks to PostHog’s cringe forward brand ethos
5/27/2026, 7:45:05 PM
by: Waterluvian
PostHog was a system we set up once, generally don't think about, and review from time to time, providing some occasional value. It <i>was</i> mostly harmless to leave around.<p>But it's apparently <i>yet one more thing</i> we have to be actively suspicious of as it defaults towards an intolerable state. So it's easier to just rip it out of the system and move on.
5/27/2026, 5:03:07 PM
by: sixtyj
Most companies would bury this change in a deceptively boring T&Cs update, but we value transparency, so here's what you need to know in an internet-friendly numbered list:<p>Users on our EU cloud instance are opted out by default<p>So too users with agreements that prevent training (e.g. BAA, MSA, or similar)<p>All other users on our US cloud instance are opted in by default<p>We will anonymize all data before it's used for training<p>We will only use data that already exists in your PostHog instance<p>We will do all the model training ourselves, which means...<p>We won't sell or send your data to third-party model providers<p>You can opt out at any time via your org settings in PostHog (admin access required)<p>Training won't start until June 29, so there's plenty of time to decide
5/27/2026, 4:15:21 PM
by: 48terry
> Why this is opt out, not opt in<p>> Put simply, because otherwise we will not have enough data to train a model that's actually useful.<p>Hmm, when asked to opt-in to giving their data away for yet another AI non-service, people don't want to. That's strange! The only way to get their data is to assume you can take it and force them to tell you to stop. Wonder what that could mean? Oh well, it's a mystery no one will be able to solve.
5/27/2026, 6:57:21 PM
by: frankest
What a great reminder to build my own analytics and self host. PostHog just lost a customer. They could easily send a email to each customer asking if we want this. The assumption means they have no product intuition about their own customers, let alone the customers of their customers. Bye.
5/27/2026, 4:45:17 PM
by: infecto
Thanks for posting. I had been in the fence for the past few months of switching. The new AI products combined with the weird UIs had been irking me for a while. This is the final nail in the coffin. Opt-in is a terrible business model imo.
5/27/2026, 4:51:35 PM
by: tines
“Opt-in by default” = opt-out?
5/27/2026, 4:33:20 PM
by: brauhaus
Every day I'm more glad about EU legislation, that's all I have to say for now
5/27/2026, 4:45:21 PM
by: thecatapps
It's probably very obvious by now, but there's something to be said about companies with the "SF Quirky" vibes:<p>- The OS Redesign<p>- "Sexy Legal Documents"<p>- Emails with "<relevant hedgehog meme goes here>" as the subject line<p>- Having a merch shop with action figures of your CEO<p>It works both ways. When you're looking for adoption and making very pro-user moves, I guess it can be a benefit. However, when you're now looking to grow revenue and making very anti-user moves, it's insult to injury.<p>I'm the last person to say that tech "shouldn't be fun" or something overly-broad like that, but if your messaging doesn't match the decisions of leadership, you're gonna have a bad time.
5/27/2026, 5:33:33 PM
by: Dave_Rosenthal
They say, "our goal here is to improve PostHog as a product for our customers, not to expose or sell models trained on your data" but then don't actually list that as a limitation in the bulleted points.<p>AFAICT this now gives them default permission to train an LLM on your code (as Posthog telemetry data is inextricably tied to your code) use it, and even sell it if they wanted to (as it's not your data anymore, it's their model). Yikes.
5/27/2026, 5:44:29 PM
by: freshnode
Why won't companies explain what anonymisation means for them?<p>Posthog has unfettered logged in access to some sensitive stuff. What steps are they actually taking to scrub sensitive data from my replay before being used to train a model?
5/27/2026, 5:03:12 PM
by: abustamam
> Why this is opt out, not opt in<p>> Put simply, because otherwise we will not have enough data to train a model that's actually useful.<p>AKA we won't be able to make as much money if we required you to give us permission to use your data.
5/27/2026, 5:25:44 PM
by: the__alchemist
How much are they paying the users?
5/27/2026, 5:19:13 PM
by: rad_val
All of them do if you don't do something about it(e.g. migrate to self hosted solutions), trusting a ToS in 2026 is as naive as it gets.
5/27/2026, 5:48:48 PM
by: stevoski
I’ve been evaluating PostHog for our company.<p>I’ve now made our decision. We won’t be using them.<p>If they are going to position yourself as the non-slimy no-BS guys, they can’t pull this nonsense.
5/27/2026, 5:54:52 PM
by: ASinclair
Mostly unrelated but the name of this company makes me think it's a Dick-Pics-as-a-Service provider.
5/27/2026, 5:14:05 PM
by: staticautomatic
Friendly reminder that you don't have to enable PostHog replays at all. I have a site lightly instrumented server-side with the slim bundle, and I'm still gonna double check my account settings but I'm pretty sure it's not even capable of doing the replay telemetry.
5/27/2026, 6:59:52 PM
by: mrcwinn
Gross.<p>They’ll use your product and your data to later sell a product back to you.
5/27/2026, 5:14:08 PM
by: jen20
Perhaps if they hopped on a quick call for five minutes with some customers, they'd realize quite how little appetite there is for putting up with being opted into things automatically in the US but not in the EU.<p>As an aside, this also means the EU rules are working.
5/27/2026, 4:48:22 PM
by: bigstrat2003
This is the fastest way possible to ensure I will never do business with you, or stop doing business with you if I already am.
5/27/2026, 4:40:58 PM
by: gyoridavid
I feel that the US should step up their legislation game and make sure these companies can't retroactively make rules to steal their users data. I know it's trendy to hate the EU but their legislation actually protects the users, and not the companies interests.
5/27/2026, 5:27:21 PM
by: tartieret
I initially used Posthog as an alternative to Google Analytics with more privacy. Now they want to use the data for a business purpose. Working hard towards enshitification?
5/27/2026, 4:08:42 PM
by: calmbonsai
LOL. You stay classy PostHog.
5/27/2026, 5:14:42 PM
by: Henchman21
You can’t “opt-in” to something that is the default. The choice is made for you — and when the choice is made for you? You haven’t opted in or out?
5/27/2026, 4:40:24 PM
by: TZubiri
Today I was thinking, if I start a company in the LLM tooling space, I would put in the company mission in the incorporation documents that client data will not be used to train.<p>The temptation and the value is too great, and the opt-in opt-out consent thing ends up being a fuckery where the company tries to trick the user into allowing them to take a look into the data, presumably because they are selling the product at a loss and need an alternative revenue model.<p>Just make it impossible from the get-go, the fine print would be that the data can be shared off-band explicitly, in an email, or if explicitly copy pasted in a support chatbox, but there would be no mechanism for us to read the data from the databases much less from the client.<p>I don't mean it would be an air-tight mechanism like Signal or ProtonMail, if a court order would ask us to produce client info, we would still reserve the right to produce the data, but exceptionally, and definitely not for training models.
5/27/2026, 4:45:44 PM
by: dzonga
another would be excellent product company destroyed or being destroyed slowly due to VCs and the ever chase for 'growth'
5/27/2026, 5:25:11 PM
by: mikkelam
The enshittification has begun. Time to move on!
5/27/2026, 5:25:16 PM
by: slopinthebag
PostHog better transition to an AI company soon because they are one of the SAAS's which are absolutely cooked by vibe coding. What it does is extremely amenable to LLMs and it's also non-critical for a business, making it an excellent candidate for replacement by in-house solutions. And if it means never having to use their website again that's even better.<p>I wonder if they regret opensource, considering people will be using LLMs to replace them which have surely trained off of their code.
5/27/2026, 4:41:34 PM
by: Ozzie-D
[flagged]
5/27/2026, 7:26:42 PM
by: Ayush_Khati1
[flagged]
5/27/2026, 4:59:55 PM
by: jasonmp85
[dead]
5/27/2026, 4:56:10 PM