this post was submitted on 07 Jun 2024
1 points (100.0% liked)

Technology

59601 readers
2940 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each another!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed

Approved Bots


founded 1 year ago
MODERATORS
 

A user on the online forum 4chan has leaked a massive 270GB of data purportedly belonging to The New York Times. This leak includes what is claimed to be the source code for the newspaper’s digital operations.

top 50 comments
sorted by: hot top controversial new old
[–] DudeImMacGyver@sh.itjust.works 0 points 5 months ago (2 children)

Source code... for a website?

[–] Potatos_are_not_friends@lemmy.world 0 points 5 months ago (1 children)

Subscription software. Tracking software. Ad tools. Promotion tools. Tools for journalists.

The website is just what you see.

[–] DudeImMacGyver@sh.itjust.works 0 points 5 months ago (1 children)

Yeah, I guess I didn't consider all the other operational shit that goes into providing content and funding for the website.

[–] aStonedSanta@lemm.ee 0 points 5 months ago (2 children)

It’s why our PCs have gotten insanely fast but websites still load like fucking trash. All the back end spying shit takes up a ton of cpu cycles. If you don’t already have em run ublock origin and no script and the internet is so fucking speedy 😆

[–] DudeImMacGyver@sh.itjust.works 0 points 5 months ago (2 children)

I hadn't noticed but then again I run Ublock Origin on Firefox.

load more comments (2 replies)
load more comments (1 replies)
[–] MacNCheezus@lemmy.today 0 points 5 months ago* (last edited 5 months ago)

Anything more complicated than a static website is going to have a significant amount of server-side code.

Also, the article explains that it's not just the website, but ALL of their repos, which would include their smartphone apps, backend tools, etc.

[–] autonomoususer@lemmy.world 0 points 5 months ago* (last edited 5 months ago) (3 children)

We still have no legal right to use, change and share exact and modified copies of its source code, control it both ourselves and in groups. It's still anti-libre software, useless, dangerous.

[–] seathru@lemmy.sdf.org 0 points 5 months ago* (last edited 5 months ago) (2 children)

Anything that may help develop better adblockers/paywall bypasses or exposes how/what of our personal information is collected is a win in my book. And this may very well be none of those things.

[–] autonomoususer@lemmy.world 0 points 5 months ago* (last edited 5 months ago)

They only exist when we keep them relevant and we already know we can't prove it's private but if it helps some people, that's good.

[–] 0xD@infosec.pub 0 points 5 months ago (2 children)

Right, because fuck paying for proper journalism. Everything must be free!

Remind me again, how does that work?

[–] errer@lemmy.world 0 points 5 months ago

I pay for the NYT, and yet every other screen is a fucking ad (often the same ad repeated over and over). You already have my subscription money, and unless they decide not to be so greedy (haha), their ads get shoved up my pihole.

load more comments (1 replies)
[–] magi@lemmy.blahaj.zone 0 points 5 months ago

Very few care about licenses unless the use of such material can be proven, and good luck with that

Just seeing how something is approached helps.

I sometimes rebuild software from one language to another for practice.

[–] JoMiran@lemmy.ml 0 points 5 months ago (1 children)

I expect that paywall to be fully useless soon.

[–] Dark_Arc@social.packetloss.gg 0 points 5 months ago (2 children)

That's a really silly take ... a Paywall is just an authorization mechanism.

That's like saying the source code of lemmy leaks and you expect your account to be compromised any second.

[–] example@reddthat.com 0 points 5 months ago (2 children)

I can sell you a copy of lemmys source code, are you interested?

[–] DudeImMacGyver@sh.itjust.works 0 points 5 months ago

I'll sell it for cheaper!

[–] wabafee@lemmy.world 0 points 5 months ago

I can give you 25 schmeckles.

load more comments (1 replies)
[–] RagingHungryPanda@lemm.ee 0 points 5 months ago (5 children)
[–] General_Effort@lemmy.world 0 points 5 months ago (2 children)

270GB of mostly node modules?

[–] SpaceNoodle@lemmy.world 0 points 5 months ago

You're right, it would be bigger if it was node

[–] asdfasdfasdf@lemmy.world 0 points 5 months ago

Sounds pretty average

[–] wreckedcarzz@lemmy.world 0 points 5 months ago

"send nodes"

[–] SpaceNoodle@lemmy.world 0 points 5 months ago (2 children)
[–] autonomoususer@lemmy.world 0 points 5 months ago* (last edited 5 months ago) (1 children)

I also hate making things from smaller pieces, the engineering in software engineering. /s

load more comments (1 replies)
[–] CaptainSpaceman@lemmy.world 0 points 5 months ago (1 children)

Node has been around longer than web3

NPM nightmares intensify

[–] floquant@lemmy.dbzer0.com 0 points 5 months ago (4 children)
load more comments (4 replies)
load more comments (2 replies)
[–] lurch@sh.itjust.works 0 points 5 months ago (2 children)

reminds me of the time someone said "Who is this 4chan?" on tv and it became a meme. good times

[–] Isoprenoid@programming.dev 0 points 5 months ago (1 children)
[–] PipedLinkBot@feddit.rocks 0 points 5 months ago

Here is an alternative Piped link(s):

https://www.piped.video/watch?v=qz5i171h_no

Piped is a privacy-respecting open-source alternative frontend to YouTube.

I'm open-source; check me out at GitHub.

load more comments (1 replies)
[–] skymtf@pricefield.org 0 points 5 months ago (2 children)

I have not read the news in a really long time just cause paywalls are annoying as frick.

[–] Dark_Arc@social.packetloss.gg 0 points 5 months ago (1 children)

Consider paying for the news...?

[–] Serinus@lemmy.world 0 points 5 months ago (1 children)

I'd only do that if you want independent news.

[–] Dark_Arc@social.packetloss.gg 0 points 5 months ago (3 children)

I'm not sure what you're saying here ...

[–] PrivateNoob@sopuli.xyz 0 points 5 months ago (2 children)

He probably means one of these (or both):

  1. New York Times is a huge corporation. The commenter would only support a site which is run by one creator, or with a genuine small team, which is transparent and not an asshole.

  2. New York Times is biased politically or accepting bribery attempts from other corpos to make them look in a better light.

[–] TheBat@lemmy.world 0 points 5 months ago
  1. New York Times is a huge corporation. The commenter would only support a site which is run by one creator, or with a genuine small team, which is transparent and not an asshole.

Yeah but good luck chasing multiple stories across the world as a small team.

[–] Serinus@lemmy.world 0 points 5 months ago (5 children)

Jesus Christ, no. It's almost like you're trying to sow distrust in the news and facts.

The NYT isn't perfect, but it's some of the most reliable news the world has.

As of March 2023, The New York Times Company employs 5,800 individuals,[101] including 1,700 journalists according to deputy managing editor Sam Dolnick.[122] Journalists for The New York Times may not run for public office, provide financial support to political candidates or causes, endorse candidates, or demonstrate public support for causes or movements.[123] Journalists are subject to the guidelines established in "Ethical Journalism" and "Guidelines on Integrity".[124] According to the former, Times journalists must abstain from using sources with a personal relationship to them and must not accept reimbursements or inducements from individuals who may be written about in The New York Times, with exceptions for gifts of nominal value.[125] The latter requires attribution and exact quotations, though exceptions are made for linguistic anomalies. Staff writers are expected to ensure the veracity of all written claims, but may delegate researching obscure facts to the research desk.[126] In March 2021, the Times established a committee to avoid journalistic conflicts of interest with work written for The New York Times, following columnist David Brooks's resignation from the Aspen Institute for his undisclosed work on the initiative Weave.[127]

load more comments (5 replies)
[–] Serinus@lemmy.world 0 points 5 months ago (1 children)

Pay for news if you want it to be independent, and not beholden to sponsors.

I'd go as far as to say that paying for news (if you have the means to do so comfortably), is your duty as a commitment to democracy.

[–] Dark_Arc@social.packetloss.gg 0 points 5 months ago (1 children)

Ahh, yes I agree on all points; thanks for the clarification!

[–] Serinus@lemmy.world 0 points 5 months ago (2 children)

It's amazing the number of times on Lemmy that someone will come in with the completely opposite "explanation" for what I was saying. Almost like they have an agenda.

It's so weird to turn my statement of "support the news with money" into "the mainstream media can't be trusted".

Maybe it's only happened twice, but it's still weird that it's happened twice.

[–] PumaStoleMyBluff@lemmy.world 0 points 5 months ago

Phrasing things in semi-sarcastic converse will do that

[–] Dark_Arc@social.packetloss.gg 0 points 5 months ago* (last edited 5 months ago)

I was wondering if that's where you were going in part.

I think it's a bit of the phrasing; you stated an opinion that's vague to the point of tiptoeing towards the potentially loaded question: "who's independent media?"

It's not uncommon in the conservative media sphere to see a similar (typically series) of leading ambiguous questions. They're never genuine, it's always in the style of:

You know what the best operating system is? I'll tell you what the best operating system is, it's Linux. Do you know why Linux is the best operating system? It's because it's got penguins and penguins are great! Do you know why penguins are great? I mean, can you think of a more iconic bird? That's why, that is why ... and Big Microsoft is out to destroy your hopes and dreams aren't they? Yes, yes they absolutely are, with their soulless Windows operating system that's manufactured by the flying spaghetti monster. Now obviously folks, only use Linux if you support freedom not the unholy flying spaghetti monster. The flying spaghetti monster will destroy America. It's its one true mission. Support freedom, support penguins, stop the flying spaghetti monster.

I think it's made a bunch of if antsy lol

load more comments (1 replies)
load more comments (1 replies)
[–] Dark_Arc@social.packetloss.gg 0 points 5 months ago* (last edited 5 months ago) (1 children)

I doubt this will affect much ... that's a lot more source code than I'd expect though, dang.

Presumably a lot of it is for internal operations (custom editing software or something of that ilk).

[–] tal@lemmy.today 0 points 5 months ago

It sounds like it's not all source code, from the article.

[–] muntedcrocodile@lemm.ee 0 points 5 months ago (2 children)

Thats a lot of data but surly its not all their articles cos I'd very much like to train mixtral7x8b on it along with 4chan data and shir from the dark web. Surly there is a project where such a model is public and being trained on literally everything regardless of legality.

load more comments (2 replies)
[–] daddy32@lemmy.world 0 points 5 months ago

NY Times has a freaking great data visualisations, they are (were?) employing a wizard in this space, doing custom extensions on d3.js.

[–] jonne@infosec.pub 0 points 5 months ago (2 children)

Now everyone will get to run Wordle!

load more comments (2 replies)
[–] bloodfart@lemmy.ml 0 points 5 months ago

Critical support

[–] merthyr1831@lemmy.world 0 points 5 months ago* (last edited 5 months ago)

270GB feels insane for the source code of a single organisation. Is there media assets or backups in there too?

EDIT: yep, multiple subsidiaries and slack Comms which could inflate it by a lot. we post a whole lot of uncompressed shit on our slack

load more comments
view more: next ›