Continuation of `Having "blessed" 3rd party libraries may make D more popular` DIP thread
WraithGlade
wraithglade at protonmail.com
Sat Jul 5 14:24:09 UTC 2025
On Friday, 4 July 2025 at 22:29:13 UTC, monkyyy wrote:
> On Friday, 4 July 2025 at 21:23:07 UTC, WraithGlade wrote:
>> Tangentially, I seldom publish anything on GitHub because
>
> Publish everything, it doesnt matter no one actually looks
On the contrary, people *do* look at other people's source code
on GitHub, otherwise there would be no point in the site besides
distribution and there are better ways to do that.
Besides people though, there is also the problem of automated
plagiarism via "AI" bots, which is motivated by large companies
wanting the ability to steal the creative output and digital
labor of all of the rest of humanity and the whole internet in a
way that allows them to essentially launder the copyright of it
all (including both open source and proprietary code).
Microsoft's ownership of GitHub gives them more of a
superficially/ostensibly legal excuse to train based on the data,
even though there is no *ethical* grounds for doing so. In fact,
I've long suspected that that was actually one of the biggest
reasons they bought the site, if not the #1 reason in reality.
In fact, the risk could be spread even more broadly than that:
Any software that is under the control of any such tech company
could conceivably send any code processed by it (e.g. by a
compiler) to the company's database to feed their AI-powered
thievery. In fact, there have been trends of big tech companies
in recent years doing pretty much anything to get access to more
data to surreptitiously train their automated plagiarism systems
(LLM AI) on. That's why I won't ever fully trust anything made by
these companies, not even programming languages like C#
(Microsoft) or Go (Google). I even recall reading about Google
inserting code that tries to phone home into all executables
compiled by Go, though that was not related to AI if my memory
was correct. It's possible I may use C# or Go if it is the only
practical solution for something, but it is far from ideal from
the standpoint of someone who has learned these companies
increasingly cannot be trusted whatsoever.
Similarly, pretty much all cloud data storage service providers
have shown signs in recent years of changing their terms in ways
that grant them permission to train AI based on any and all of
your data (e.g. valuable idea documents, art assets, code, etc)
you have on their servers. Thus, if you write a text memo to
yourself that says "million dollar software idea" and store it on
Google Drive or Dropbox and someone subsequently asks ChatGPT or
similar for a million dollar idea then your idea may end up in
the results given to the user, but in a highly rearranged and
transformed way designed to make detecting and proving the
plagiarism difficult or impossible.
Nonetheless, thank you for the attempted reassurance and the
good-natured and wholesome spirit in which it was given, but I'm
not naive enough to trust these companies nor random people
browsing code on the internet.
Ideas are often very valuable, contrary to misguided but trending
beliefs in recent years that "there are so many ideas that there
is no danger in sharing them" and such. Human nature is what it
is. Wholesome people are abundant, but so are unethical people.
I only trust evidence and logic. Those are the only true
authorities on anything in this world that can ever exist.
Popular assumptions and trends aren't anywhere close to
trustworthy.
We good-natured people must learn to protect ourselves better and
to not be naive. I wish that your reassurances were sound, but
they aren't. Such a world would be nice though! I appreciate the
sentiment!
Putting all that aside: I hope everyone had a wonderful 4th of
July yesterday!
More information about the Digitalmars-d
mailing list