Asked our head of sales if putting client addresses in ChatGPT was data sharing. She looked at me like I was the idiot.

[-]

Call-Me-Leo@reddit

We need an AI plugin on computers that will detect when people are putting sensitive information or data into AI chatbots. Bring back clippy and have them say “ are you sure you want to upload this to the cloud? It seems like private information.”

[-]

ribsboi@reddit

It's called Purview

[-]

Icy_Performer_9675@reddit

you use it ?

[-]

ribsboi@reddit

yes

[-]

khantroll1@reddit

How do you leverage purview in this way?

VividVigor@reddit

I was not aware Microsoft Defender for Endpoints could enforce actions on the clipboard. Thanks. I thought only a secure browser or TLS inspection on a cloud gateway were the only two ways to detect and block this.

[-]

mrmugabi@reddit

This right here. It worked surprisingly well when I rolled it out a while back.

[-]

belzaroth@reddit

This sounds like the blind leading the blind.

That's unfortunate.

If you use Copilot you can force that the document you upload needs to have a Purview classification and only certain levels of confidentiality are allowed to be uploaded.

Malicious intent isn't stopped by that as you can just classify the data as public or copy and paste it into the box.

But for a case like OP where people are just ignorant it helps.

[-]

Morkai@reddit

I would refuse to take that bet because you're quite likely right. Unfortunately there's been a significant amount of turnover in our team over the last ~5 years (I technically have longest tenure currently and I've been here less than a year)

What time we live in.

UpsetMarsupial@reddit

People will click anyway. Just like "This site might be insecure. Do you wish to continue?" type questions. Just like "There are updates to install. Do you wish to update or postpone". Users ignore the essence of the warning and instead click to progress with whatever task they have in mind.

[-]

AppropriateSpell5405@reddit

I think MS rolled out a GPO for detecting this, at least from within Edge.

[-]

Aroe2k@reddit

We use Netskope for this, it’s configured to only allow AI usage with approved tools and domains.

[-]

TheBigBeardedGeek@reddit

Crowd strike does this on Windows I think. Ultimately what we need as orgs is better DLP implementation

[-]

FriendlyITGuy@reddit

These do exist. We were testing out Forcepoint Remote Browser Isolation specifically for this but unfortunately it was nothing but issues for the testers.

If your customer data is in SharePoint, then you can put it in Copilot. Same TOS.

Same stuff with Google Drive and Gemini.

[-]

Legal_Situation@reddit

Just tossing this out there that some features may have different TOS, such as things within Google Labs. Currently I think Google Flow (AI Video) doesn't use the same TOS for example.

[-]

moonski@reddit

thing is google / MS could well say " we dont use this to train ai" in the their TOS and just use that data anyway... whats going to happen? They get fined a few hundred mil 5/10 years down the line? They don't care.

[-]

PowerShellGenius@reddit

That is true, but it is true whether it is "AI" or not. If you do not trust Microsoft not to scoop up data for AI training in direct violation of a binding contract, you cannot store any data where they technically could do that. This includes SharePoint, OneDrive, Exchange Online, etc.

AI may be the motive (training) for misusing customer data. But there is zero correlation between whether the service you input the data to is Copilot, and whether Microsoft can be trusted to honor a contract and not misuse it. If they are stealing data for AI training they would definitely train it on your Teams chats as a source of Natural Language samples...

So if you are going down the "what if cloud providers are blatantly breaking their own ToS" rabbit hole, you're back to everything on prem.

[-]

moonski@reddit

. If they are stealing data for AI training they would definitely train it on your Teams chats as a source of Natural Language samples...

would it surprise if they hadnt already done that?

[-]

Krigen89@reddit

Maybe. Probably.

On personal accounts, with different TOS.

[-]

ConflictResident5253@reddit

No one has ever enforced a contract against Microsoft successfully, to my knowledge. What would you do, sue 'em? They'd drag it out for decades like they did the class action about fraudulently pushing Office purchases in Canada.

They call you to understand how something works BEFORE contracts are signed, once the agreement is in place it's not your rodeo.

I say that as someone in the same exact position and it frustrates me to no end how many supposedly intelligent IT people seem to think ChatGPT/Claude/etc. are somehow different than any other enterprise tool. I did a demo of some agentic workflow automation I created for my team and it freaked a lot of people out and it took a lot of time for me to do damage control with the CISO, super annoying.

[-]

ConflictResident5253@reddit

Dang, son.

[-]

ConflictResident5253@reddit

They faked FEDRAMP and nothing happened. It follows that they fake everything else too.

[-]

Legal_Situation@reddit

Technically, yeah. But then again, you could kind of say this for anything. They'd also likely get a large amount of lawsuits from companies with their own legal teams and capital to take up that legal fight, so it's not as cut and dry as cartoon villainy.

Really this is more about making the legal ramifications of taking an action like that actually have teeth. US Privacy laws aren't great based on my layman's understanding of them. That said, who knows what that would look like when corporations feel their IP was threatened by it.

ConflictResident5253@reddit

No, I'm saying Microsoft doesn't even seem to read its own contracts, ignores their obligations to their customers, and does whatever it wants. There's a visible history of just not delivering on obligations and not caring.

Plus, the JET-derived database that backends m365 data doesn't have the kind of architecture that can separate sensitive and non-sensitive data. There's no way this promise has the technical capacity to be realized.

They say a lot of things. Doesn't make it real.

[-]

nyokarose@reddit

Which then amazes me that nearly everyone has their data in M365, with or without LLMs.

Thank you for helping articulate some of the holes!

[-]

Tetha@reddit

German Dataprotection Laws and I think the GDPR as well have this idea of "data frugaility" and "frugal use of personally identifying data". This means should should always be questioning if data has to be stored, processed, or even worse, handed to third parties in general.

I think that is a very good wording and mindset to get into: Does this improve the answer from an LLM? Like, if I want it to check an email I am writing, does something like a personal address or a phone number actually increase what the LLM could improve or analyze? If not, it should not be included by principle.

Oh the other hand, in production, customers pay us to store this kind of data. So we have to.

[-]

nyokarose@reddit

Yep, I work with EU so am familiar with GDPR, you’re right. I also know that in many cases it’s impossible to strip some identifying information when using a SaaS service; the European Council themselves use M365 (and ostensibly copilot along with it, to read emails…. Which all have identifying info in the signature…) I expected more than just data privacy from the original comment.

[-]

It really is intent. If I put customer pii data in, I don't want a random tool use to allow the model to push the pii to the internet.

PR firms exist literally because they do want to be evil but don't want to be perceived as evil.

[-]

porkchameleon@reddit

Google even once had the catchphrase "Don't be Evil".

The full version was "Don't be evil like that" /s

[-]

Adventurous-House-32@reddit

Also the wifi password on the gBus if you're ever cruising down 101 in NorCal

[-]

pdp10@reddit

Google even once had the catchphrase "Don't be Evil".

It was supposed to be, "Don't be Microsoft", but counsel said not to use a trademark.

Ubuntu issue number 1 was "Microsot has a majority market share".

[-]

Yuugian@reddit

That and $12 will get you a StarBucks. Y'all can trust them if you want, but i obfuscate the name of the org i'm signed in with

[-]

guareber@reddit

The problem is that from your org's perspective, it doesn't matter if it's true - it only matters if the contract say it's true. Your org doesn't care about anything other than its liabilities.

[-]

MagicWishMonkey@reddit

That and violation of contract would give the company grounds to sue, it’s crazy how many people here think OpenAI is just ignoring their contract agreements. If that was true and it became public the lawsuits would sink the company. There’s a reason that almost never happens.

[-]

ConflictResident5253@reddit

Blatant contract violations happen all the time, though. No one holds vendors accountable, ever. They throw all their capital into the Legal and Marketing departments to help dilute consequences.

How does one become a high value Target

This is a key thing I'm always wondering. How many people could even do that if they tried? Like without destroying their life

[-]

JasonDJ@reddit

Work in defense sector, you'll be beating off attackers with both hands.

[-]

based

[-]

fearless-fossa@reddit

People heavily underestimate how much knowledge you can get by connecting the right data. There is a semi-famous video by a data scientist who used just the dates articles are published and the author's names to make educated guesses about the internal structures of the newspaper and the people working there. SpiegelMining – Reverse Engineering von Spiegel-Online (33c3)

[-]

Previous-Low4715@reddit

I know, I did data protection for an F500 and I’ve spent the last year setting up copilot for a large government organisation. I’m not sure what you’re getting at.

[-]

shikkonin@reddit

I’m not sure what you’re getting at.

That, depending on the jurisdiction you're operating in, there simply no way you can ever use Copilot or any of the other LLMs legally if you provide personal data to them.

[-]

Previous-Low4715@reddit

It’s the same handling as any other data in M365 in a GDPR etc bound location, you use a tool like purview with trainable classifiers to identify the data in your environment and enforce data boundaries through DLP and so on. Copilot data does not leave your tenant.

[-]

shikkonin@reddit

Copilot data does not leave your tenant.

[-]

lesusisjord@reddit

Copilot attached to enterprise Microsoft accounts has the same data protection as the rest of the 365 stack, so if I can include it an email/Teams/SharePoint, I am safe to include it in my copilot chat.

I couldn’t care less who sees my prompts. Copilot is for work shit and ChatGPT is for everything else.

gryghin@reddit

I remember when Google appliances first landed in the data centers so that we would have that functionality without exposing sensitive IP.

You would think this would be a thing for ChatGPT.

cyber insurance leads to a lot of products being purchased and box ticking theater. Its super lucrative for security vendors and companies like KnowBe4 which are lip service to doing something about email which is by far the biggest succesful attack vector. Cyber insurance has done some good such as forcing MFA on a lot of things it needs to be on. But for the most part it forces people to buy products they wont really use or know how to use to check the box.

The downside of all this is the poor implementation of a lot of these products leads to security fatigue and user confusion. We are sending them legimate emails that look like phishing and then wonder why they get phished.

Point is, all this worry about AI exfiltration is not wrong, but it really is a much much lower actual risk than a lot of things people do every day.

I am all for security. But cyber insurance style box ticking is a lot of theatrics and $$$ for the vendors.

I swear KnowBe4 which is the industry's weak assed attempt to solve the Phishing problem is in cahoots with the insurance issuers. Otherwise there is no way that company would even exist. Yes I said Phishing is the biggest risk and then called something that addresses phishing useless. I know that seems wrong

Yet company leadership pushes employees to use AI for everything they do.

[-]

ency@reddit

I'm willing to be most employees, even if they know, just don't give a damn.

[-]

anortef@reddit

Worked at places at which the response from the CEO when told the risks was legit: As long as the sales outpace the fines there is no problem.

[-]

ency@reddit

That's always the case for the C level people. Breaking the law isn't an issue if the fine is less than the profit, ignoring safety issues is fine as long as the payout is less than the profit, ignoring security isn't an problem as long as the fix after getting called out is less than the profit.

I'm all for capitalism and think it's the best of a lot of bad options as long as it's constrained and regulated. The corporate death penalty has to become a thing and the c levels need to be held accountable for much of their bs. I'm a huge cynic but I am still fairly optimistic when it comes to people. If the companies started to give a damn and showed it then the employees would as well. Doing that would go a long way in plugging a bunch of the easy to reach holes when it comes to security. But as things stand I don't do shit and I don't say shit when I see issues unless it's gonna effect my role. I'd gladly let the company burn to the ground while making sure the items in my job description are taken care of. It's not my responsibility to make sure others do their part.

[-]

Bogus1989@reddit

oh i hate that…

The intent seriously matters.

It is why I vehemently despise google. They actually believe their shit dont stink…they couldnt be that dumb right? microsoft amazon facebook?

you cant continue just giving the middle finger guys….you have to pay the piper at some point….

Finorix079@reddit

The mental model people have is "I'm asking a tool a question," not "I'm sending data to a third party." Those are very different in how a brain processes risk. Posters won't fix it because posters address policy, not the model. What works is making the safe path easier than the unsafe path, like an enterprise tier that auto-redacts before the prompt leaves the browser, or a corporate ChatGPT instance that defaults to no-training. Friction-on-the-unsafe-path is policy theater. Friction-removed-from-the-safe-path actually changes behavior.

[-]

dumblebees@reddit

it’s not your job to train people. your job is to cover your ass and take home a paycheque that is as large as possible for as long as possible.

[-]

cjcox4@reddit

All AI was created from stealing end user data. All of it. Something to remember when your favorite AI model says they "won't use" the input data you send for "training". Do not trust these folks.

[-]

Nagroth@reddit

Enterprise versions have legally binding agreements. If they don't uphold their end there are civil and in some cases criminal ramifications.

Using any version that isn't signed off by the Legal department can get you into the same sort of trouble as posting it on your public social media page. So ya, don't do that.

[-]

Aggravating_Refuse89@reddit

One problem is this talk is all theoretical. You post stuff on Facebook its easily seen, but what is the worst case scenario of someone putting sensitive data into an AI?

They could have a breach? Yes so could anything.

If this threat is as bad as many say it is and I am not convinced. what is an example and how would it look?

Most users could not give two rat turds about the company. But they care very much about getting caught for real. Assuming the org doesnt have all the wiz bang monitoring shit or anyone to look at it, lets say a dummy puts customer info into Chat GPT. How does that play out in a way that they get caught? Breach? Suit?

I think proving to end users that, its a reasonable risk to believe someone could get the data I put into ChatGPT and prove it was me that entered it, and that my boss can get that info from the issue itself not internal monitoring.

Theoretical its bad is not gonna stop end users. Saying, if you put that in there and something bad happens, its going to clearly come back to you, that scares end users.

I have yet to see one person do this.

If I go and put Joe Blow's private info into ChatGPT. How does this go bad and how does it come back to me? Assuming my company does not monitor for this and many people in my org have access to Joe Blow's data?

Real world stuff that make sense to users. Cause I dont even have that for myself.

Legally binding only matters if caught and one has compelling evidence of violation. Agreed?

I've worked for fortune 100 companies that have done far worse things.

[-]

Careful-Criticism645@reddit

If there's no compelling evidence of a violation, then what's the issue?

[-]

Jimthepirate@reddit

For me it is funny how with AI people suddenly are so aware of their data, but then do not think twice using cloud providers to store emails and all their files. I had people with serious face tell me how concerned they are on MS copilot but then have all their emails run on exchange online and sharepoint store 99% of documents.

Yes there is a good reason to be concerned, but at the end of the day, unless you run your own thing, you are putting the trust in someone. Just need to do due diligence when choosing your stack and evaluate your vendors. For free stuff you are always the product. Did you know that for example Gemini a human reviewer can look at your prompt and output “to ensure better quality”. I find “someone peeping your conversations“ is way more effective argument to get people listen.

[-]

Aggravating_Refuse89@reddit

Google is the absolute worst for prying eyes.

[-]

cjcox4@reddit

I think it's also funny that "giving up your privates" is defined as "modern day zero trust". Ironic.

[-]

SquareWheel@reddit

Public websites are not end user data, and scraping websites is not illegal.

[-]

Copyright law makes copying copyrighted material without permission from the rights holder illegal outside of situations that can be classified as fair use. In the US fair use has been consistently established as a case specific matter that cannot be ruled on generally, that needs to be tested separately for each case base on the matters of the case. In both of the summary judgements in those two cases that is re-affirmed.

And both summary judgements specifically do not rule on the illegal acquisition of copyrighted material. Kadrey et al v. Meta Platforms, Inc. is currently an ongoing class action and Bartz v. Anthropic resulted in a $1.5 billion settlement over the piracy claims.

https://copyrightalliance.org/wp-content/uploads/2025/06/Bartz-v.-Anthropic-Order.pdf

https://law.justia.com/cases/federal/district-courts/california/candce/3:2023cv03417/415175/598/

Please find the part of these rulings that sets a carte blanche precedent that scraping copyrighted material is legal.

[-]

SquareWheel@reddit

Please find the part of these rulings that sets a carte blanche precedent that scraping copyrighted material is legal.

This is a reversal of how the law is structured. You don't need to find legal exceptions to determine if something is allowed. Acts are permitted by default unless determined to be illegal. The burden of proof rests with those claiming that something is illegal.

In this case, however, I'm not arguing that a carte blanche exists. Scraping has been shown to be legal under more specific scenarios such as accessing public information, but not when bypassing captchas, login pages, or other technical barriers. That access is enough for AI training, and does not violate the CFAA.

You claimed that general copyright law protects published data, however simply downloading (public) website data has not been shown to be a copyright violation. It's how that data is used is the deciding factor.

We've already gone over case law, but there's many other precedents which show legal use of web content under fair use. Examples are web indexes such as Google Search, news agencies quoting tweets or reddit comments, YouTube remixes, and reverse image search tools.

In the US fair use has been consistently established as a case specific matter that cannot be ruled on generally

That's true. Fair use is a defense, not a general legal protection. In cases where it's disputed, it may go to trial. That's what we're seeing now with a number of AI cases. However, so far training these AI models has been found to be highly transformative, and thus good candidates for fair use exceptions. Once case law is better established, a stronger precedent will be set.

And both summary judgements specifically do not rule on the illegal acquisition of copyrighted material.

Again, the issue of Anthropic and Meta utilizing piracy is irrelevant to the question of training. Piracy is clearly illegal, and they should be and are being fined for it. But the original question was over public web scraping, and as discussed here, there is strong precedent for scraping being legal.

[-]

PatHeist@reddit

Copyright infringement is explicitly illegal in and of itself. There are exceptions to what is generally illegal if you can demonstrate fair use. Fair use being an affirmative defense matters. You are right to say that acts are legal unless they are illegal, but in this case you're doing the equivalent to arguing that murder is legal because killing in self defense can be ruled to have been justified.

You say there has been strong precedent for scraping being legal. Cases I have heard of affirm that scraping of copyrighted material in general is not legal. The judgements in the cases you've linked outline at multiple points that copying copyrighted materials to make a database of them without a specific purpose is in and of itself obviously illegal and establish IB length that their comments on later using those databases to train AI is a separate question from acquiring the training material. You appear to be conflating the training rulings, which are explicitly divorced from the data acquisition question, in a way where it comments on the data acquisition. I struggle to see how this makes sense. In other cases companies like Google have repeatedly lost regarding scraping and retaining copyrighted material where it could not establish a fair use defense and they have had to modify practices as a result.

I maintain that if you want to claim that "scraping", which I understand to mean "downloading and retaining from websites", of copyrighted material is generally legal, contrary to copyright law, you should be the one to produce cases that establish that precedent. It seems like you think the cases above do, but obviously I am not reading the judgements the same way you are. If I am missing the part where the judgements establish that scraping copyrighted material is legal, please point it out. Or support your argument with one of the other cases you say establish this precedent.

[-]

SquareWheel@reddit

It seems at the very least that we agree that scraping under fair use is permitted. There's disagreement on if scraping for general use - even if it may not be used in infringing ways - is also permitted.

You've referenced specific lines in the judgment that I would need to spend time reviewing -- more time unfortunately than I have right now. I would however be willing to review and evaluate shortly. If I've made a mistake, I'd like to correct it, and would recognize the error.

To bring this discussion back to its starting point though, since we agree that scraping under fair use is accepted within the US legal framework, and that training AI models specifically qualifies as fair use, it follows that training models on public website data is permitted. Or to be more precise, would be highly defensible in a suit. This may change depending on the outcome of future court cases, but existing precedent does point in this direction.

Then it's a non issue.

[-]

nyckidryan@reddit

Since when is it my job to give people access to AI tools? Should I give them access to Tor as well?

[-]

phunky_1@reddit

I guess if you just blindly run IT without suggesting tools that can improve business, knock yourself out.

People are going to do it regardless, you might as well make it safe for your data.

[-]

Aggravating_Refuse89@reddit

Safe is the wrong word. Make it someone elses responsiblity like Uncle Micro$oft

[-]

Linkpharm2@reddit

A GPU with Qwen3.6 27b and vLLM will help a lot to solve this problem.

[-]

kylethedesigner@reddit

Absolutely, and honestly a lot of office tasks don’t even need AI when a simple python script would accomplish the same thing faster and for free.

[-]

Aggravating_Refuse89@reddit

Many would need AI to write that script.

Snarky response

[-]

03263@reddit

It's probably in a million different data leaks already.

Everyone and their mother has potential access to my full name, home address, phone numbers, SSN, insurance info, email, leaked passwords, etc. I've cashed enough data breach settlement checks to be sure of it.

So pardon me if I don't really care about it continuing to happen. Just keep sending those checks.

[-]

Aggravating_Refuse89@reddit

This is the attitude I am talking about.

[-]

OldGeekWeirdo@reddit

Can you go to ChatGPT at your computer and using a different account, ask questions about company deals?

If it shows it knows about it and will tell strangers, it will land much harder.

[-]

wazza_the_rockdog@reddit

It's not an immediate thing and may not allow a direct lookup due to guardrails it sets up, more of the concern is that someone looking for similar info in the future may be given that live data as an example.

[-]

Aggravating_Refuse89@reddit

This is the problem. Its not immedaite and very much may not tie back to the actual user. Employees dont care about the company. At all. They care about their ass. If that is not directly threatened they are not going to care about the rest. If I could go search for data and find that Jeff entered it, Jeff would be scared. But I have not heard of any situation where it comes to that.

[-]

cpz_77@reddit

I was about to say something like “you guys have training and policy posters for AI stuff?” till I realized you probably mean security/data sharing training. And yeah unfortunately as you mentioned most users probably aren’t going to make that connection, at least not the less technical ones. We’ve had some users that had asked questions indicating they do understand it (generally more technical “power users”) but others have said or done things that makes it very clear they don’t.

We desperately need some actual “AI best practices” training for all users I think (specific to AI), but part of the problem is I don’t think we (IT) even really fully knows what those are yet. Obviously there are security standards we can and do already train users on but taken outside of the context of the IT world many don’t know how to apply that to other situations like with AI. Plus there’s the whole topic of productivity and “moving fast” ( 🙄 ) vs. maintaining process and security. Every company/entity has to make that decision for themselves as to where they want to set that line.

But I don’t think any of this is really a thing yet, anywhere. Everybody and their brother is balls deep in this AI craze but I haven’t heard anyone say their company is spending cycles actively trying to draw up an official AI “best practices” or “usage guide” and/or put together some sort of AI training for users at their company. Doesn’t mean it isn’t happening of course - maybe some places somewhere are doing that - but I certainly haven’t seen it anywhere (least of all at my own place 🤣 ).

[-]

mysysadminalt@reddit

Meanwhile we had to turn off TLS inspection for these platforms so DLP no longer works 🙃

[-]

Aggravating_Refuse89@reddit

This is some reality here. If you do not have that, it really doesnt matter what else you do have

[-]

therankin@reddit

Oh crap! You just reminded me that there a problem with my TLS cert and I forgot to make a reminder about it!

Aggravating_Refuse89@reddit

This is exactly how Microsoft stays in business. Corporations want a minimum viable product. This is also why the AI is going to replace you hype is BS. Not as long as its limited to most to being only on par with Teams, Sharepoint, Bing, Zune, and Windows 11. Co-pilot is a Microsoft turd version of something both useful and dangerous. The entreprenuer types want innovation but lawyers dont. AI is going to be at best the next Google and some automation

[-]

MrHaxx1@reddit

I genuinely don't understand why people think Copilot is useless.

It's great for anything in M365, and for everything else, it does the job. I kind of hate much of a pain in the ass it is to use connectors, where as it is two clicks in Claude, but otherwise it's fine.

[-]

Is she using a public api? If so this should be blocked by default.

[-]

Aggravating_Refuse89@reddit

going to go against the grain and say, its bad. Its probably against policy and coudl get someone fired depdning on said policy and if they are trying to get rid of someone. Dont put real info in AI.

Their risk is more that the IT department will be tasked with documented it and it be used as a way to fire someone.

The actual real risk of your data leaking out of an AI is not zero, but its not likely. I will say that super proprietary info or classified info is a bad idea. But most likely unless there is a breach of the AI and it can be tied back to coming from you, the risk is pretty low.

A lot of orgs use DLP to catch this

Now in person I would totally say the opposite because the security dogma states you will believe it. Just like I believe a lot of "security" is placebo box ticking. But to say any of that is blasphemy if you work in IT.

So bottonm line, you are in IT. You caught someone. The risk is low. But what does the policy say you do about it?

Its astronomically low chance this will cause a real problem but its a definite no no in security dogma.

[-]

Scullyx@reddit

Give employees a 'Do Everything' button

Employees push 'Do Everything' button

Im shock

[-]

dllhell79@reddit

You should have called it "disclosing proprietary company information and trade secrets" instead of "data sharing". Sure, she may not be doing that fully, but it sounds more forceful and impactful. It's sounds like a negative thing that could have consequences. The term data sharing almost implies that it's a good thing.

[-]

There is an option to turn off training model from input in ChatGPT. And sincerely, if you are part of IT and you do not advise your department about how to handle data in AI tools, it is your failure.

[-]

Secret_Account07@reddit

I mean do you have an enterprise license?

I don’t use ChatGPT but most AI enterprise licenses have data protection. I can drop full server info on Copilot just like I can with Sharepoint

[-]

MagicWishMonkey@reddit

Reading all these replies by people who A) have no idea what their actually contractual agreement is while B) assuming they are informed of the legal implications of using licensed software

Not only is that 100% not your fucking job, but you're also not informed enough to have an opinion and you definitely shouldn't be telling users what they can and cannot do. These people are just looking for ways to get fired, it's wild.

[-]

postbox134@reddit

This is why ChatGPT enterprise exists.

If they demand the tools you've got to provide it or shadow IT happens. And no Copilot isn't good enough (although easier to deploy and manage), people want the native tool (ChatGPT or Claude).

[-]

bobo_1111@reddit

yeah or gemini instance is hilarious. it just treats me like an end user sometimes and says i could be performing unauthorized things….

I would stop calling it “data sharing” and call it data exfiltration or data breach. Get serious about the name of this behaviour because “sharing” sounds like a nice, collegial thing to do.

Send emails with phrases like exfiltration attempts detected. Data breach results in loss of revenue and brand reputation and directly impacts every employee. Yada yada.

[-]

FlyingBishop@reddit

ChatGPT is just an app. Uploading data to it is no different from putting it in Google Docs, if you have an enterprise agreement. Yes, they store it on their servers. Sure, you don't trust OpenAI but why do you trust Microsoft when you upload PII to O365? Why do you trust Google with Google Docs? There's no breach unless you can demonstrate the data was used by someone outside the company.

[-]

Bogus1989@reddit

exactly this.

Varrianda@reddit

If it’s an enterprise version it’s fine, that’s the whole point. If y’all are just using ChatGPT.com though….

[-]

thepatientwaiting@reddit

I couldn't even use Grammarly at my last job because it was risking client confidentiality.

[-]

Competitive_Smoke948@reddit

that's definitely a gdpr issue AND she's putting your proprietary data, sales, strategy etc in a place competitors can find it... that's a firing offence... report her to HR

[-]

Ok-Measurement-1575@reddit

It's far worse than that. People aren't actually stupid, they just don't care and who can blame them?

Completely wrong approach.

toasterdees@reddit

Aren’t companies info like emails and addresses and phone numbers publicly available? I don’t see why you can use those

[-]

Cayayu@reddit

That is considered personal information. You shouldn’t be sharing it.

[-]

Professional_Rip103@reddit

Enterprise chatgpt doesnt train on data but that doesnt actually solve the core issue which is that sensitive data is still leaving your network and hitting a third party server. Compliance doesnt care if openai trains on it or not. The data was exfiltrated either way. thats why we switched to a model where the detection happens at the browser before anything gets sent anywhere.

[-]

itskdog@reddit

sensitive data is still leaving your network and hitting a third party server.

The same happens when you use Outlook or SharePoint.

[-]

Morlark@reddit

I don't think you understand what exfiltration actually is. I'll give you a clue: data leavining the network is a necessary but not sufficient criterion for exfiltration.

Companies work with external partners in ways that involve them processing data on your behalf all the time. As long as you have appropriate contracts in place delimiting what they are permitted to do with that data (process it on your behalf for purposes that you specify), and what security they have in place (obviously), it's 100% compliant.

It is, by definition, not exfiltration if you give it to them willingly.

[-]

zbignew@reddit

On the other hand, you can get a BAA agreement with Anthropic and then compliance doesn’t care at all about HIPAA.

The whole point of HIPAA was to make sure customers are lightly inconvenienced so they think something is being done. And then they will assume that the notes from their pelvic exam aren’t being read by temp office staff at the medical billing company.

[-]

Empty_Allocution@reddit

Coming from an education perspective:

I'm lucky because we get to block all of this shit. Staff only have access to a ring-fenced Gemni.

However, few years back we absolutely had braindead shit going on, like proper sensitive data getting dumped into ChatGPT to create 'reports'. It was an absolute nightmare. GDPR is lumped with me too, so we had to take a step back and basically give everyone the third degree and tune our web filters to be harsher.

The big problem in education is that every app under the sun is adopting some form of AI as a selling point. Many kids and teachers go mad for these flashcard generators with LLMs built in. We have seen so many instances where you can just break the LLM out of its rules etc, and get it to write code, do your homework etc.

dedjedi@reddit

You cannot reason a person out of a position they did not reason themselves into.