Welcome Guest ( Log In | Register )

Outline · [ Standard ] · Linear+

 Nearly 40% of web pages from 2013 are no longer ac, Maybe the internet doesn't last forever?

views
     
TShaya
post Jun 7 2024, 08:09 AM, updated 4d ago

Sarawakian first!
*******
Senior Member
2,067 posts

Joined: Jan 2003

Nearly 40% of web pages from 10 years ago are no longer accessible
Maybe the internet doesn't last forever?
By Christianna Silva on May 22, 2024

Every young person is undoubtedly given the advice: Be careful what you put on the internet because the internet is forever.

This advice is pretty good. Posting online can still have grave consequences, from getting suspended from school to losing your job. But life online might not be quite as eternal as we think.

According to new research from the Pew Research Center, 38 percent of web pages from 2013 are no longer accessible, and a quarter of all web pages that existed from 2013 to 2023 are no longer available. This trend is undoubtedly more aggressive for older content, which, I suppose, does make sense. For instance, just eight percent of pages that existed in 2023 are no longer available.

This phenomenon is called "digital decay," a sensation in which links to content across the internet, on government and news websites, on the "references" section of Wikipedia, and even X (then known as Twitter) no longer work. The 404 message is becoming all too common.

For instance, about a fifth of all tweets are no longer visible on the site a few months after being posted, either because the account went private, was suspended, or deleted. Tweets written in Turkish or Arabic were more likely to vanish than tweets written in other languages.

As the Columbia Journalism Review wrote, "The fragility of the Web poses an issue for any area of work or interest that relies on written records. Loss of reference material, negative SEO impacts, and malicious hijacking of valuable outlinks are among the adverse effects of a broken URL. More fundamentally, it leaves articles from decades past as shells of their former selves, cut off from their original sourcing and context. And the problem goes beyond journalism. In a 2014 study, for example, researchers (including some on this team) found that nearly half of all hyperlinks in Supreme Court opinions led to content that had either changed since its original publication or disappeared from the internet."

Link rot and digital decay can make some parts of the internet virtually unusable. Have you ever clicked on a news story and found that most of the tweets embedded in the post are blank, and the hyperlinks are no longer active? It's frustrating — and can hurt our ability to understand subjects and issues with context.

Source: https://mashable.com/article/internet-doesn...t-digital-decay
smallcrab
post Jun 7 2024, 08:11 AM

Getting Started
**
Junior Member
140 posts

Joined: Jul 2007
From: Puchong


Geocities how
TShaya
post Jun 7 2024, 08:17 AM

Sarawakian first!
*******
Senior Member
2,067 posts

Joined: Jan 2003

QUOTE(smallcrab @ Jun 7 2024, 08:11 AM)
Geocities how
*
https://archive.org/web/geocities.php

https://geocities.restorativland.org/

https://www.oocities.org

(The irony that some of the Geocities archives like http://www.geocities.ws/ have died is not lost on me)
wailam
post Jun 7 2024, 08:33 AM

Apa benda ini?
*****
Junior Member
976 posts

Joined: Nov 2014
From: Sandakan, Sabah


server dead.
not pay web service.
etc etc.
MegaCanonF
post Jun 7 2024, 08:35 AM

Enthusiast
*****
Junior Member
866 posts

Joined: Jan 2018
inb4 cloud how?
MRaef
post Jun 7 2024, 08:36 AM

Casual
***
Junior Member
382 posts

Joined: Nov 2005

While it is unfortunate, it was a blessing of disguise for some people in the present day.

Any dumb, idiotic comments people made in the past are lost forever.
TShaya
post Jun 7 2024, 08:36 AM

Sarawakian first!
*******
Senior Member
2,067 posts

Joined: Jan 2003

As China’s Internet disappears, ‘we lose parts of our collective memory’
By Li Yuan
Wednesday, 05 Jun 2024 10:00 AM MYT

Chinese people know their country’s Internet is different. There is no Google, YouTube, Facebook or Twitter. They use euphemisms online to communicate the things they are not supposed to mention. When their posts and accounts are censored, they accept it with resignation.

They live in a parallel online universe. They know it and even joke about it.

Now they are discovering that, beneath a facade bustling with short videos, livestreaming and ecommerce, their Internet – and collective online memory – is disappearing in chunks.

A post on WeChat on May 22 that was widely shared reported that nearly all information posted on Chinese news portals, blogs, forums and social media sites between 1995 and 2005 was no longer available.

“The Chinese Internet is collapsing at an accelerating pace,” the headline said. Predictably, the post itself was soon censored.

“We used to believe that the Internet had a memory,” He Jiayan, a blogger who writes about successful businesspeople, wrote in the post. “But we didn’t realise that this memory is like that of a goldfish.”

It’s impossible to determine exactly how much and what content has disappeared. But I did a test. I used China’s top search engine, Baidu, to look up some of the examples cited in He’s post, focusing on about the same time frame between the mid-1990s and mid-2000s.

I started with Alibaba’s Jack Ma and Tencent’s Pony Ma, two of China’s most successful Internet entrepreneurs, both of whom He had searched for. I also searched for Liu Chuanzhi, known as the godfather of Chinese entrepreneurs, who made headlines when his company, Lenovo, acquired IBM’s personal computer business in 2005.

I looked, too, for results for China’s top leader, Xi Jinping, who during the period was the governor of two big provinces. Search results of senior Chinese leaders are always closely controlled. I wanted to see what people could find if they were curious about what Xi was like before he became a national leader.

I got no results when I searched for Ma Yun, which is Jack Ma’s name in Chinese. I found three entries for Ma Huateng, which is Pony Ma’s name. A search for Liu Chuanzhi turned up seven entries.

There were zero results for Xi.

Then I searched for one of the most consequential tragedies in China in the past few decades: the Great Sichuan earthquake on May 12, 2008, which killed over 69,000 people. It happened during a brief period when Chinese journalists had more freedom than the Communist Party would usually allow, and they produced a lot of high-quality journalism.

When I narrowed the time frame to May 12, 2008, to May 12, 2009, Baidu came up with nine pages of search results, most of which consisted of articles on the websites of the central government or the state broadcaster Central Central Television. One caveat: If you know the names of the journalists and their organizations, you can find more.

Each results page had about 10 headlines. My search found what had to have been a small fraction of the coverage at that time, much of which was published on the sites of newspapers and magazines that sent journalists to the epicenter of the earthquake. I didn’t find any of the outstanding news coverage or outpouring of online grief that I remembered.

In addition to disappearing content, there’s a broader problem: China’s Internet is shrinking. There were 3.9 million websites in China in 2023, down more than one-third from 5.3 million in 2017, according to the country’s Internet regulator.

China has one billion Internet users, or nearly one-fifth of the world’s online population. Yet the number of websites using Chinese language make up only 1.3% of the global total, down from 4.3% in 2013 – a 70% plunge over a decade, according to Web Technology Surveys, which tracks online use of top content languages.

The number of Chinese language websites is now only slightly higher than those in Indonesian and Vietnamese, and smaller than those in Polish and Persian. It’s half the number of Italian language sites and just over a quarter of those in Japanese.

One reason for the decline is that it is technically difficult and costly for websites to archive older content, and not just in China. But in China, the other reason is political.

Internet publishers, especially news portals and social media platforms, have faced heightened pressure to censor as the country has made an authoritarian and nationalistic turn under Xi’s leadership. Keeping China’s cyberspace politically and culturally pure is a top order of the Communist Party. Internet companies have more incentive to over-censor and let older content disappear by not archiving.

Many people have had their online existences erased.

Two weeks ago, Nanfu Wang found that an entry about her on a Wikipedia-like site was gone. Wang, a documentary filmmaker, searched her name on the film review site Douban and came up with nothing. Same with WeChat.

“Some of the films I directed had been deleted and banned on the Chinese Internet,” she said. “But this time, I feel that I, as a part of history, have been erased.” She doesn’t know what triggered it.

Zhang Ping, better known by his pen name, Chang Ping, was one of China’s most famous journalists in the 2000s. His articles were everywhere. Then in 2011, his writing provoked the wrath of the censors.

“My presence in public discourse has been stifled much more severely than I anticipated, and that represents a significant loss of my personal life,” he told me. “My life has been negated.”

When my Weibo account was deleted in March 2021, I was saddened and angered. It had more than three million followers and thousands of posts recording my life and thoughts over a decade. Many of the posts were about current affairs, history or politics, but some were personal musings. I felt a part of my life had been carved away.

Many people intentionally hide their online posts because they could be used against them by the party or its proxies. In a trend called “grave digging”, nationalistic “little pinks” pour over past online writings of intellectuals, entertainers and influencers.

For Chinese, our online memories, even frivolous ones, can become baggage we need to unload.

“Even though we tend to think of the Internet as somewhat superficial,” said Ian Johnson, a longtime China correspondent and author, “without many of these sites and things, we lose parts of our collective memory.”

In Sparks, a book by Johnson about brave historians in China who work underground, he cited the Internet Archive for Chinese online sources in the endnotes because, he said, he knew they would all eventually disappear.

“History matters in every country, but it really matters to the CCP,” he said, referring to the Chinese Communist Party. “It’s history that justifies the party’s continued rule.”

Johnson founded the China Unofficial Archives website, which seeks to preserve blogs, movies and documents outside the Chinese Internet.

There are other projects to save Chinese memories and history from falling into a void. Greatfire.org has several websites that provide access to censored content. China Digital Times, a nonprofit that fights censorship, archives work that has been or is in danger of being blocked. Zhang is its executive editor.

He, author of the WeChat post that went viral, is deeply pessimistic that China’s erasure of history can be reversed.

“If you can still see some early information on the Chinese Internet now,” he wrote, “it is just the last ray of the setting sun.”

Source: https://www.thestar.com.my/tech/tech-news/2...llective-memory
TShaya
post Jun 7 2024, 08:39 AM

Sarawakian first!
*******
Senior Member
2,067 posts

Joined: Jan 2003

QUOTE(wailam @ Jun 7 2024, 08:33 AM)
server dead.
not pay web service.
etc etc.
*
I appreciate the "etc etc", but try and find anything about Abdullah Ahmad Badawi's tenure on the PMO website.

QUOTE(MegaCanonF @ Jun 7 2024, 08:35 AM)
inb4 cloud how?
*
When do you think websites are hosted?
xpole
post Jun 7 2024, 08:43 AM

Rain on me baby
******
Senior Member
1,410 posts

Joined: Dec 2009
From: Everywhere



Videos are mostly permanent to be honest

Your own sex video, your own jerk off video on webcam that you recorded in 2013 can still available.

I have ex friend from uni recorded his sex video, the video still available the last time I accidentally found this year.

This post has been edited by xpole: Jun 7 2024, 08:44 AM
kitzai
post Jun 7 2024, 08:47 AM

Getting Started
**
Junior Member
172 posts

Joined: Oct 2007
well .. nothing lasts forever
mcchin
post Jun 7 2024, 08:50 AM

SLAVA UKRAINI !
*******
Senior Member
3,902 posts

Joined: Jul 2005
From: Sin Lor, B'worth,Pg.
QUOTE(kitzai @ Jun 7 2024, 08:47 AM)
well .. nothing lasts forever
*
https://www.spacejam.com/1996/
TShaya
post Jun 7 2024, 08:52 AM

Sarawakian first!
*******
Senior Member
2,067 posts

Joined: Jan 2003

QUOTE(xpole @ Jun 7 2024, 08:43 AM)
Videos are mostly permanent to be honest

Your own sex video, your own jerk off video on webcam that you recorded in 2013 can still available.

I have ex friend from uni recorded his sex video, the video still available the last time I accidentally found this year.
*
Yet a lot of the video's from 2014 about MH370 news coverage has disappeared.

The paradox of the internet.
delon85
post Jun 7 2024, 08:52 AM

Ad Astra
******
Senior Member
1,061 posts

Joined: Jan 2003



QUOTE(xpole @ Jun 7 2024, 08:43 AM)
Videos are mostly permanent to be honest

Your own sex video, your own jerk off video on webcam that you recorded in 2013 can still available.

I have ex friend from uni recorded his sex video, the video still available the last time I accidentally found this year.
*
How you accidentally found unless you're not looking for it?

And where's the link?
msacras
post Jun 7 2024, 09:01 AM

Regular
******
Senior Member
1,767 posts

Joined: Jan 2019
QUOTE(xpole @ Jun 7 2024, 08:43 AM)
Videos are mostly permanent to be honest

Your own sex video, your own jerk off video on webcam that you recorded in 2013 can still available.

I have ex friend from uni recorded his sex video, the video still available the last time I accidentally found this year.
*
Disgusting where.jpg

More importantly, the actress and video quality gooding or not?
iGamer
post Jun 7 2024, 09:08 AM

Toxic ktards probably losers irl
******
Senior Member
1,374 posts

Joined: Feb 2016
From: Milky Way
Make sense, who will keep the storage and server active forever without revenue to cover the cost?
MR_alien
post Jun 7 2024, 09:15 AM

Mr.Alien on the loss
*******
Senior Member
3,582 posts

Joined: Oct 2007
From: everywhere in sabah



internet is forever but web hosting and domain isn't...it's not free...somebody has to pay for it
iZuDeeN
post Jun 7 2024, 09:31 AM

Look at all my stars!!
*******
Senior Member
3,466 posts

Joined: Jan 2003
From: PJ, Malaysia
QUOTE(iGamer @ Jun 7 2024, 09:08 AM)
Make sense, who will keep the storage and server active forever without revenue to cover the cost?
*
true..I've been paying avg RM300 a year for domain & hosting to host my portfolio... that's RM3000/10 yrs without any revenue
ciwi1166
post Jun 7 2024, 09:33 AM

~~~Prestissimo~~~
*******
Senior Member
2,263 posts

Joined: Dec 2006
From: In your head...
QUOTE(delon85 @ Jun 7 2024, 08:52 AM)
How you accidentally found unless you're not looking for it?

And where's the link?
*
some psycho peoples downloaded and keep porn for years, and reupload them in future.
so yeah, once you put on the net can never be taken back.
TShaya
post Jun 7 2024, 10:59 AM

Sarawakian first!
*******
Senior Member
2,067 posts

Joined: Jan 2003

Linkrot
The internet is a library built on quicksand.
May 22, 2024

user posted image

Here’s an underrated cognitive virtue: “object permanence” — that is, remembering how you perceived something previously. As Riley Quinn often reminds us, the left is the ideology of object permanence — to be a leftist is to hate and mistrust the CIA even when they’re tormenting Trump for a brief instant, or to remember that it was once possible for a working person to support their family with their wages:

https://pluralistic.net/2023/10/27/six-sell...olding-it-wrong

The thing is, object permanence is hard. Life comes at you quickly. It’s very hard to remember facts, and the order in which those facts arrived — it’s even harder to remember how you felt about those facts in the moment.

This is where blogging comes in — for me, at least. Back in 1997, Scott Edelman — editor of Science Fiction Age — asked me to take over the back page of the magazine by writing up ten links of interest for the nascent web. I wrote that column until the spring of 2000, then, in early 2001, Mark Frauenfelder asked me to guest-edit Boing Boing, whereupon the tempo of my web-logging went daily. I kept that up on Boing Boing for more than 19 years, writing about 54,000 posts. In February, 2020, I started Pluralistic.net, my solo project, a kind of blog/newsletter, and in the four-plus years since, I’ve written about 1,200 editions containing between one and twelve posts each.

This gigantic corpus of everything I ever considered to be noteworthy is immensely valuable to me. The act of taking notes in public is a powerful discipline: rather than jotting cryptic notes to myself in a commonplace book, I publish those notes for strangers. This imposes a rigor on the note-taking that makes those notes far more useful to me in years to come.

Better still: public note-taking is powerfully mnemonic. The things I’ve taken notes on form a kind of supersaturated solution of story ideas, essay ideas, speech ideas, and more, and periodically two or more of these fragments will glom together, nucleate, and a fully-formed work will crystallize out of the solution.

Then, the fact that all these fragments are also database entries — contained in the back-end of a WordPress installation that I can run complex queries on — comes into play, letting me swiftly and reliably confirm my memories of these long-gone phenomena. Inevitably, these queries turn up material that I’ve totally forgotten, and these make the result even richer, like adding homemade stock to a stew to bring out a rich and complicated flavor. Better still, many of these posts have been annotated by readers with supplemental materials or vigorous objections.

I call this all “The Memex Method” and it lets me write a lot (I wrote nine books during lockdown, as I used work to distract me from anxiety — something I stumbled into through a lifetime of chronic pain management):

https://pluralistic.net/2021/05/09/the-memex-method/

Back in 2013, I started a new daily Boing Boing feature: “This Day In Blogging History,” wherein I would look at the archive of posts for that day one, five and ten years previously:

https://boingboing.net/2013/06/24/this-day-...ng-history.html

With Pluralistic, I turned this into a daily newsletter feature, now stretching back to twenty, fifteen, ten, five and one year ago. Here’s today’s:

https://pluralistic.net/2024/05/21/noway-back-machine/#retro

This is a tremendous adjunct to the Memex Method. It’s a structured way to review everything I’ve ever thought about, in five-year increments, every single day. I liken this to working dough, where there’s stuff at the edges getting dried out and crumbly, and so your fold it all back into the middle. All these old fragments naturally slip out of your thoughts and understanding, but you can revive their centrality by briefly paying attention to them for a few minutes every day.

This structured daily review is a wonderful way to maintain object permanence, reviewing your attitudes and beliefs over time. It’s also a way to understand the long-forgotten origins of issues that are central to you today. Yesterday, I was reminded that I started thinking about automotive Right to Repair 15 years ago:

https://www.eff.org/deeplinks/2009/05/right-repair-law-pro

Given that we’re still fighting over this, that’s some important perspective, a reminder of the likely timescales involved in more recent issues where I feel like little progress is being made.

Remember when we all got pissed off because the mustache-twirling evil CEO of Warners, David Zaslav, was shredding highly anticipated TV shows and movies prior to their release to get a tax-credit? Turns out that we started getting angry about this stuff twenty years ago, when Michael Eisner did it to Michael Moore’s “Fahrenheit 911”:

https://www.nytimes.com/2004/05/05/us/disne...cizes-bush.html

It’s not just object permanence: this daily spelunk through my old records is also a way to continuously and methodically sound the web for linkrot: when old links go bad. Over the past five years, I’ve noticed a very sharp increase in linkrot, and even worse, in the odious practice of spammers taking over my dead friends’ former blogs and turning them into AI spam-farms:

https://www.wired.com/story/confessions-of-...ckbait-kingpin/

The good people at the Pew Research Center have just released a careful, quantitative study of linkrot that confirms — and exceeds — my worst suspicions about the decay of the web:

https://www.pewresearch.org/data-labs/2024/...ent-disappears/

The headline finding from “When Online Content Disappears” is that 38% of the web of 2013 is gone today. Wikipedia references are especially hard-hit, with 23% of news links missing and 21% of government websites gone. The majority of Wikipedia entries have at least one broken link in their reference sections. Twitter is another industrial-scale oubliette: a fifth of English tweets disappear within a matter of months; for Turkish and Arabic tweets, it’s 40%.

Thankfully, someone has plugged the web’s memory-hole. Since 2001, the Internet Archive’s Wayback Machine has allowed web users to see captures of web-pages, tracking their changes over time. I was at the Wayback Machine’s launch party, and right away, I could see its value. Today, I make extensive use of Wayback Machine captures for my “This Day In History” posts, and when I find dead links on the web.

The Wayback Machine went public in 2001, but Archive founder Brewster Kahle started scraping the web in 1996. Today’s post graphic — a modified Yahoo homepage from October 17, 1996 — is the oldest Yahoo capture on the Wayback Machine:

https://web.archive.org/web/19960501000000*/yahoo.com

Remember that the next time someone tells you that we must stamp out web-scraping for one reason or another. There are plenty of ugly ways to use scraping (looking at you, Clearview AI) that we should ban, but scraping itself is very good:

https://pluralistic.net/2023/09/17/how-to-t...about-scraping/

And so is the Internet Archive, which makes the legal threats it faces today all the more frightening. Lawsuits brought by the Big Five publishers and Big Three labels will, if successful, snuff out the Internet Archive altogether, and with it, the Wayback Machine — the only record we have of our ephemeral internet:

https://blog.archive.org/2024/04/19/interne...rchive-lawsuit/

Libraries burn. The Internet Archive may seem like a sturdy and eternal repository for our collective object permanence about the internet, but it is very fragile, and could disappear like that.

Source: https://doctorow.medium.com/https-pluralist...ew-a5ee2b44c2d4
K.I.T.T
post Jun 7 2024, 11:11 AM

On my way
****
Junior Member
672 posts

Joined: Sep 2008
QUOTE(xpole @ Jun 7 2024, 08:43 AM)
Videos are mostly permanent to be honest

Your own sex video, your own jerk off video on webcam that you recorded in 2013 can still available.

I have ex friend from uni recorded his sex video, the video still available the last time I accidentally found this year.
*
mesti rasa time machine kan
Quantum Geist
post Jun 7 2024, 11:15 AM

Getting Started
**
Junior Member
109 posts

Joined: May 2013


more occurrence of

"hey I found the solution to my ultra niche issue here: <dead link>"

soon
jonthebaptist
post Jun 7 2024, 11:25 AM

Regular
******
Senior Member
1,036 posts

Joined: Sep 2022
Sure die when google doesn't send you the traffic, if you notice, whenever you google search anything these days, the results are often something recent. Won't show you 10 year old things for example
jonthebaptist
post Jun 7 2024, 11:28 AM

Regular
******
Senior Member
1,036 posts

Joined: Sep 2022
QUOTE(xpole @ Jun 7 2024, 08:43 AM)
Videos are mostly permanent to be honest

Your own sex video, your own jerk off video on webcam that you recorded in 2013 can still available.

I have ex friend from uni recorded his sex video, the video still available the last time I accidentally found this year.
*
Even if available it's probably not in its original HD source quality. Meaning you're watching some low quality version of the original that has been downloaded, re-uploaded and compressed till kingdom come.
GOPI56
post Jun 7 2024, 11:31 AM

Regular
******
Senior Member
1,495 posts

Joined: Dec 2012
QUOTE(MegaCanonF @ Jun 7 2024, 08:35 AM)
inb4 cloud how?
*
Cloud also same, if you have hosted web-site on a cloud server and no pay for monthly or yearly subscription fees. The website will be gone.
Izzet
post Jun 7 2024, 11:43 AM

Getting Started
**
Junior Member
157 posts

Joined: Oct 2008
QUOTE(xpole @ Jun 7 2024, 08:43 AM)
Videos are mostly permanent to be honest

Your own sex video, your own jerk off video on webcam that you recorded in 2013 can still available.

I have ex friend from uni recorded his sex video, the video still available the last time I accidentally found this year.
*
Hmm... PM link for research and education purpose pls

TShaya
post Jul 1 2024, 12:19 PM

Sarawakian first!
*******
Senior Member
2,067 posts

Joined: Jan 2003

user posted image
party
post Jul 1 2024, 12:29 PM

Enthusiast
*****
Senior Member
816 posts

Joined: May 2013


I would prefer not to challenges net citizens. Seems deleted but once u went retard citizens will upped u
TShaya
post Jul 2 2024, 01:58 PM

Sarawakian first!
*******
Senior Member
2,067 posts

Joined: Jan 2003

QUOTE(haya @ Jul 1 2024, 12:19 PM)
» Click to show Spoiler - click again to hide... «

*
user posted image
Source: https://variety.com/2024/music/opinion/disa...1236052577/amp/
TShaya
post Aug 2 2024, 06:17 PM

Sarawakian first!
*******
Senior Member
2,067 posts

Joined: Jan 2003

user posted image
Source: https://www.niemanlab.org/2024/07/to-preser...heir-own-hands/
TShaya
post Aug 15 2024, 01:18 PM

Sarawakian first!
*******
Senior Member
2,067 posts

Joined: Jan 2003

user posted image
Source:https://slate.com/technology/2024/08/david-zaslav-warner-bros-discovery-culture-deleting-movies-tv-shows.html
Hobbez
post Aug 15 2024, 01:28 PM

Regular
******
Senior Member
1,235 posts

Joined: Dec 2009
QUOTE(haya @ Jun 7 2024, 08:36 AM)
As China’s Internet disappears, ‘we lose parts of our collective memory’
By Li Yuan
    Wednesday, 05 Jun 2024    10:00 AM MYT

Chinese people know their country’s Internet is different. There is no Google, YouTube, Facebook or Twitter. They use euphemisms online to communicate the things they are not supposed to mention. When their posts and accounts are censored, they accept it with resignation.

They live in a parallel online universe. They know it and even joke about it.

Now they are discovering that, beneath a facade bustling with short videos, livestreaming and ecommerce, their Internet – and collective online memory – is disappearing in chunks.

A post on WeChat on May 22 that was widely shared reported that nearly all information posted on Chinese news portals, blogs, forums and social media sites between 1995 and 2005 was no longer available.

“The Chinese Internet is collapsing at an accelerating pace,” the headline said. Predictably, the post itself was soon censored.

“We used to believe that the Internet had a memory,” He Jiayan, a blogger who writes about successful businesspeople, wrote in the post. “But we didn’t realise that this memory is like that of a goldfish.”

It’s impossible to determine exactly how much and what content has disappeared. But I did a test. I used China’s top search engine, Baidu, to look up some of the examples cited in He’s post, focusing on about the same time frame between the mid-1990s and mid-2000s.

I started with Alibaba’s Jack Ma and Tencent’s Pony Ma, two of China’s most successful Internet entrepreneurs, both of whom He had searched for. I also searched for Liu Chuanzhi, known as the godfather of Chinese entrepreneurs, who made headlines when his company, Lenovo, acquired IBM’s personal computer business in 2005.

I looked, too, for results for China’s top leader, Xi Jinping, who during the period was the governor of two big provinces. Search results of senior Chinese leaders are always closely controlled. I wanted to see what people could find if they were curious about what Xi was like before he became a national leader.

I got no results when I searched for Ma Yun, which is Jack Ma’s name in Chinese. I found three entries for Ma Huateng, which is Pony Ma’s name. A search for Liu Chuanzhi turned up seven entries.

There were zero results for Xi.

Then I searched for one of the most consequential tragedies in China in the past few decades: the Great Sichuan earthquake on May 12, 2008, which killed over 69,000 people. It happened during a brief period when Chinese journalists had more freedom than the Communist Party would usually allow, and they produced a lot of high-quality journalism.

When I narrowed the time frame to May 12, 2008, to May 12, 2009, Baidu came up with nine pages of search results, most of which consisted of articles on the websites of the central government or the state broadcaster Central Central Television. One caveat: If you know the names of the journalists and their organizations, you can find more.

Each results page had about 10 headlines. My search found what had to have been a small fraction of the coverage at that time, much of which was published on the sites of newspapers and magazines that sent journalists to the epicenter of the earthquake. I didn’t find any of the outstanding news coverage or outpouring of online grief that I remembered.

In addition to disappearing content, there’s a broader problem: China’s Internet is shrinking. There were 3.9 million websites in China in 2023, down more than one-third from 5.3 million in 2017, according to the country’s Internet regulator.

China has one billion Internet users, or nearly one-fifth of the world’s online population. Yet the number of websites using Chinese language make up only 1.3% of the global total, down from 4.3% in 2013 – a 70% plunge over a decade, according to Web Technology Surveys, which tracks online use of top content languages.

The number of Chinese language websites is now only slightly higher than those in Indonesian and Vietnamese, and smaller than those in Polish and Persian. It’s half the number of Italian language sites and just over a quarter of those in Japanese.

One reason for the decline is that it is technically difficult and costly for websites to archive older content, and not just in China. But in China, the other reason is political.

Internet publishers, especially news portals and social media platforms, have faced heightened pressure to censor as the country has made an authoritarian and nationalistic turn under Xi’s leadership. Keeping China’s cyberspace politically and culturally pure is a top order of the Communist Party. Internet companies have more incentive to over-censor and let older content disappear by not archiving.

Many people have had their online existences erased.

Two weeks ago, Nanfu Wang found that an entry about her on a Wikipedia-like site was gone. Wang, a documentary filmmaker, searched her name on the film review site Douban and came up with nothing. Same with WeChat.

“Some of the films I directed had been deleted and banned on the Chinese Internet,” she said. “But this time, I feel that I, as a part of history, have been erased.” She doesn’t know what triggered it.

Zhang Ping, better known by his pen name, Chang Ping, was one of China’s most famous journalists in the 2000s. His articles were everywhere. Then in 2011, his writing provoked the wrath of the censors.

“My presence in public discourse has been stifled much more severely than I anticipated, and that represents a significant loss of my personal life,” he told me. “My life has been negated.”

When my Weibo account was deleted in March 2021, I was saddened and angered. It had more than three million followers and thousands of posts recording my life and thoughts over a decade. Many of the posts were about current affairs, history or politics, but some were personal musings. I felt a part of my life had been carved away.

Many people intentionally hide their online posts because they could be used against them by the party or its proxies. In a trend called “grave digging”, nationalistic “little pinks” pour over past online writings of intellectuals, entertainers and influencers.

For Chinese, our online memories, even frivolous ones, can become baggage we need to unload.

“Even though we tend to think of the Internet as somewhat superficial,” said Ian Johnson, a longtime China correspondent and author, “without many of these sites and things, we lose parts of our collective memory.”

In Sparks, a book by Johnson about brave historians in China who work underground, he cited the Internet Archive for Chinese online sources in the endnotes because, he said, he knew they would all eventually disappear.

“History matters in every country, but it really matters to the CCP,” he said, referring to the Chinese Communist Party. “It’s history that justifies the party’s continued rule.”

Johnson founded the China Unofficial Archives website, which seeks to preserve blogs, movies and documents outside the Chinese Internet.

There are other projects to save Chinese memories and history from falling into a void. Greatfire.org has several websites that provide access to censored content. China Digital Times, a nonprofit that fights censorship, archives work that has been or is in danger of being blocked. Zhang is its executive editor.

He, author of the WeChat post that went viral, is deeply pessimistic that China’s erasure of history can be reversed.

“If you can still see some early information on the Chinese Internet now,” he wrote, “it is just the last ray of the setting sun.”

Source: https://www.thestar.com.my/tech/tech-news/2...llective-memory
*
Cultural Revolution 2.0
alucard89
post Aug 15 2024, 02:10 PM

Look at all my stars!!
*******
Senior Member
4,454 posts

Joined: Apr 2006
I remember someone posted some archived articles from sliders, sci fi show from late 90s

Lots of interesting facts there you can’t google anymore
Mattrock
post Aug 15 2024, 02:13 PM

Casual
***
Junior Member
395 posts

Joined: Dec 2017
Yes, please delete everything, including stored files, after 10 years. Save the planet. We can't afford to have more data centers sucking up energy usage. Whoever need to keep can store in their own hard discs.
Similanjau
post Aug 15 2024, 02:32 PM

New Member
*
Junior Member
13 posts

Joined: Jan 2020
my 1st fapping site uh-oh.com still running strong after 2 decades.

kekw
brkli
post Aug 15 2024, 02:33 PM

On my way
****
Junior Member
592 posts

Joined: Oct 2018
even p2p content also can died off, apatah lagi client server...
alanyuppie
post Aug 15 2024, 02:41 PM

Look at all my stars!!
*******
Senior Member
2,834 posts

Joined: Jul 2006
From: here


aside from geocities, used to have webpage in xoom.

now domain owned by a bank liao.

alanyuppie
post Aug 15 2024, 02:41 PM

Look at all my stars!!
*******
Senior Member
2,834 posts

Joined: Jul 2006
From: here


QUOTE(Similanjau @ Aug 15 2024, 03:32 PM)
my 1st fapping site uh-oh.com still running strong after 2 decades.

kekw
*
uh huh.
spursfan
post Aug 15 2024, 03:17 PM

Getting Started
**
Junior Member
110 posts

Joined: Jan 2009
QUOTE(Hobbez @ Aug 15 2024, 01:28 PM)
Cultural Revolution 2.0
*
great leap backward
ieatchickens
post Aug 15 2024, 07:27 PM

Getting Started
**
Junior Member
111 posts

Joined: Apr 2011
From: kuala lumpur


silly goy, we own your memories past and future

user posted image
TShaya
post Sep 19 2024, 03:12 PM

Sarawakian first!
*******
Senior Member
2,067 posts

Joined: Jan 2003

We're losing our digital history. Can the Internet Archive save it?
Research shows 25% of web pages posted between 2013 and 2023 have vanished. A few organisations are racing to save the echoes of the web, but new risks threaten their very existence.
By Chris Stokel-Walker

It's possible, thanks to surviving fragments of papyrus, mosaics and wax tablets, to learn what Pompeiians ate for breakfast 2,000 years ago. Understand enough Medieval Latin, and you can learn how many livestock were reared at farms in Northumberland in 11th Century England – thanks to the Domesday Book, the oldest document held in the UK National Archives. Through letters and novels, the social lives of the Victorian era – and who they loved and hated – come into view.

But historians of the future may struggle to understand fully how we lived our lives in the early 21st Century. That's because of a potentially history-deleting combination of how we live our lives digitally – and a paucity of official efforts to archive the world's information as it's produced these days.

However, an informal group of organisations are pushing back against the forces of digital entropy – many of them operated by volunteers with little institutional support. None is more synonymous with the fight to save the web than the Internet Archive, an American non-profit based in San Francisco, started in 1996 as a passion project by internet pioneer Brewster Kahl. The organisation has embarked what may be the most ambitious digital archiving project of all time, gathering 866 billion web pages, 44 million books, 10.6 million videos of films and television programmes and more. Housed in a handful of data centres scattered across the world, the collections of the Internet Archive and a few similar groups are the only things standing in the way of digital oblivion.

"The risks are manifold. Not just that technology may fail, but that certainly happens. But more important, that institutions fail, or companies go out of business. News organisations are gobbled up by other news organisations, or more and more frequently, they're shut down," says Mark Graham, director of the Internet Archive's Wayback Machine, a tool that collects and stores snapshots of websites for posterity. There are numerous incentives to put content online, he says, but there's little pushing companies to maintain it over the long term.

Despite the Internet Archive's achievements thus far, the organisation and others like it face financial threats, technical challenges, cyberattacks and legal battles from businesses who dislike the idea of freely available copies of their intellectual property. And as recent court losses show, the project of saving the internet could be just as fleeting as the content it's trying to protect.

"More and more of our intellectual endeavours, more of our entertainment, more of our news, and more of our conversations exist only in a digital environment," Graham says. "That environment is inherently fragile."

Saving our history

A quarter of all web pages that existed at some point between 2013 and 2023 now… don't. That's according to a recent study by Pew Research Center, a think tank based in Washington, DC, which raised the alarm of our disappearing digital history. Researchers found the problem is more acute the older a web page is: 38% of web pages that Pew tried to access that existed in 2013 no longer function. But it's also an issue for more recent publications. Some 8% of web pages published at some point 2023 were gone by October that same year.

This isn't just a concern for history buffs and internet obsessives. According to the study, one in five government websites contains at least one broken link. Pew found more than half of Wikipedia articles have a broken link in their references section, meaning the evidence backing up the online encyclopaedia's information is slowly disintegrating.

But thanks to the work of the Internet Archive, not all those dead links are totally inaccessible. For decades, the Archive's Wayback Machine project has sent armies of robots to crawl through the cascading labyrinths of the internet. These systems download functional copies of websites as they change over time – often capturing the same pages multiple times in a single day – and make them available to public free of charge.

"When we then went and looked at how many of those URLs were available in the Wayback Machine, we found that two-thirds of those were available in a way," he says. In that sense, the Internet Archive is doing what it set out to do – it's saving records of online society for posterity.

A few other organisations, big and small, work on similar projects. The US Library of Congress, for example, preserves government websites, the sites of congressmembers and a collection of US news sites. The Library of Congress also preserved a copy of every single tweet sent since the founding of Twitter (now known as X), until the project was shut down in 2017. Other governments run their own initiatives. The UK Web Archive conducts an annual crawl of websites with .UK domain names, capturing a snapshot of the British internet at least once a year. In 2022, band of volunteers to set out to save the Ukrainian internet as it was hit by Russian cyberattacks.

But the scope of these projects is narrow, while the Internet Archive aims for a comprehensive approach. Given the available resources, it would be impossible to collect anything close to the whole internet, but its systems cast a broad net. Depending on what you're looking for, the Internet Archive's collection is so thorough it can sometimes feel like a functionally complete record of the web.

Success breeds complacency

The Archive's publicly accessible documents help sustain records of our lives in the current era. It's become a standard practise on Wikipedia to cite copies of websites from the Internet Archive’s Wayback Machine, rather than the original websites themselves. The organisation also preserves a vast collection of media that predates the digital era. The beloved 1977 comedy series Fernwood 2 Night isn't available on any streaming service, but you can watch it free on the Internet Archive. Books, magazines and websites cite the Internet Archive’s scanned digital copies of books that are unavailable in physical libraries. It even acts as a preservation tool for the public; anyone can upload videos, websites and practically anything else to organisation's servers.

Among the major collections that the Wayback Machine has salvaged from the digital scrapheap are deep records of websites built on GeoCities, a now defunct personal web hosting service. Long before social media, GeoCities was among the first platforms that made it easy for anyone to create their own website. Historians view GeoCities as one of the most important chapters in the early days of the world wide web, without the efforts of the Internet Archive, most of its websites would be lost. In more recent history, a US Congressional Committee relied on the Internet Archive to preserve article and documents related to the January 6 insurrection.

"Every few years there's a new platform come along and then the economic forces suddenly kind of collapse in it," says Andrew Jackson, preservation registry technical architect at the Digital Preservation Coalition, a UK-based advocacy group and charity that advises on how to preserve the world's online digital archives. "That's one big source of churn."

The tech news website CNET faced backlash in 2023 after reports that the company had deleted tens of thousands of articles, amounting to decades of lost history. Among CNET's responses was a promise that all its deleted articles had been preserved in the Wayback Machine. Many critics argued the company was taking the Internet Archive for granted, passing on its own archival responsibilities.

"Even though Google and other search engines actively incentivise you to maintain stable URLs, it's just technically quite difficult to do that," says Jackson. "Every time a new company kind of revamped its website, it has to work out how much of its new URLs it's going to try and maintain through time."

But it's worth remembering what the Internet Archive is: a non-profit organisation, financed by donations from charitable foundations. It makes for a never-ending project with exponentially growing costs. The Internet Archive volunteered to take on the mantle of being the world's leading library for our digital lives. As the web approaches its fourth decade, this entirely unofficial project has become a foundational pillar of the internet.

But as our reliance on the Internet Archive grows, so too do the threats pecking away at its efforts.

Single points of failure

Last week, the organisation announced a major partnership with Google, where the tech giant engine will include links to the Wayback Machine in search results – though neither released financial details about the deal.

But other recent news demonstrates that the project is still fragile. That vulnerability was laid bare in a court case against the Internet Archive by four large book publishers, who alleged that the Internet Archive’s practise of scanning physical books and lending out digital copies breaches US copyright law. Before the pandemic, the Internet Archive would only lend one digital copy at a time for each physical book in its collection. But during the Covid shutdowns, the organisation lifted that restriction, letting patrons borrow unlimited digital copies of books to try and make up for the closure of physical libraries.

A US court ruled that practice was illegal in 2023, and in early September, the Internet Archive’s appeal against that decision was rejected. The organisation previously said that it agreed to pay the a publishing industry trade group an undisclosed sum in relation to the case.

With that lawsuit in the rearview, the Internet Archive is fighting yet another court case against music labels for digitising records that could cost it $400m (about £305m) if it loses. It's an amount that could jeopardise the non-profit's survival.

Internet Archive's director of library services Chris Freeland said the organisation is reviewing the courts' opinion a statement about the ruling.

Existential legal battles aren't the only hazards menacing the world of digital preservation. The British Library's UK Web Archive got a taste of some malevolent technical challenges last when a cyberattack took its digital systems offline in October 2023. Almost a year later, the UK Web Archive is still dealing with the fallout. Online access to much of its collection is still unavailable.

In May 2024, the Internet Archive announced it was in the midst of a large distributed denial of service (DDoS) attack. In a DDoS attack, vandals or other bad actors set up automated systems to bombard websites with visits, attempting to push them offline by overwhelming their servers. At its peak, tens of thousands of concurrent visits were happening every second. Services, including the Wayback Machine, went down. It meant that the regular drumbeat of archiving was disrupted for a time, and there may be permanent gaps in the historical record as a result.

The Internet Archive "was started by one individual, and it has become a kind of linchpin", says Jackson. "It also feels like this potential single point of failure. Although it's a lot more sophisticated than just volunteers, it is one institution in one region, under one legal framework."

The organisation shares these concerns. If the Internet Archive's work stopped and "that void wasn't immediately filled, then much of what is currently made available on the public web would be at risk", says Graham.

He's clear that the Internet Archive won't step back from its responsibilities anytime soon, but the project can use outside help. "There are opportunities for many others to contribute in a variety of ways," he says.

Shared responsibilities, split priorities

With no formal effort to organise efforts to preserve the internet, the project is left to hobbyists, volunteers, and a few groups of unofficial bodies that generally operate independently.

"It makes sense that the archival response is decentralised," says Mar Hicks, a historian of technology at the University of Virginia. "But one of the problems is the varied priorities."

Hicks points out that one of the first things any archivist will consider when building an archive is what to prioritise. "And when it's so decentralised, the priorities are going to be very different," Hicks says. "There's going to be people in groups who prioritise trying to grab everything – as much as they possibly can, they might be very completionist." Then there will be others who are focused only on certain areas – for instance, the UK archiving effort.

The concern about such an ad hoc, decentralised approach is that it's possible there's overlap, meaning precious archiving resources are wasted getting duplicate or triplicate copies of the most popular websites – all while some areas that may have historical importance are overlooked because they fall between different groups' responsibilities.

"Archivists will tell you that these issues have existed for a very long time," Hicks says. But they're exacerbated by the level of stuff being produced in our digital world. Nearly a billion emails are sent every day. YouTube reports that more than 500 hours' worth of video content is posted on the platform every minute.

The internet is "essentially a firehose of information and material," says Hicks. "It doesn't make sense to try to catch everything that comes out of the firehose. That wouldn't make sense from a resource standpoint."

In one sense this is an old concern. "We have, as historians, those same problems," says Hicks. "We have a wealth of documents from the past. But we only have certain documents and certain people's voices, and a lot of those voices that were missing were incredibly important, and they've been erased."

For Hicks, there needs to be some sort of priority about what is being saved from the digital footprints of our generation. Otherwise we run the risk that rapidly ballooning costs will sideline efforts to save the history of the web – not to mention the oceans of digital files that live offline.

"If you have to keep everything, it becomes very expensive," says Jackson of the Digital Preservation Coalition. "There's often older content or less compelling content [that] gets lost by the wayside," he says.

"We're not capturing the non-Western world well," admits Jackson. "There are gaps now around incompleteness in different cultural domains."

And while many of those organisations work to fight against their biases and prejudices, they're often left to carry the weight of the task while governments and the companies that run the platforms and websites sit by. "Independent groups of people, who are just caring about it and are willing to spend their free time doing it, are better resourced and more highly skilled than the institutions which are formally responsible," says Jackson.

There's a vacuum, argues Hicks, which few people other than a handful of archivist obsessives are filling. "It's not clear whose responsibility it is to archive [the internet] or whose interest it would serve," Hicks says.

One thing is clear, though, Hicks says, we should all pay up to support the fight for preservation. "From a very pragmatic perspective, if you do not pay these people and make sure that these archives are funded, they will not exist into the future, they will break down and then the whole point of collecting them will have gone out the window," says Hicks. "Because the whole point of the archive is not that it just gets collected, but that it persists indefinitely into the future."

The Enlightenment of the 18th century saw the birth of an international library movement as governments and philanthropists took on the need to preserve and distribute books for the public. But that sense of civic responsibility hasn't extended to the internet. That may be due to the complicated business interests of the digital world, or just the immense technical challenge. Or, perhaps, it's because it doesn't feel like the web needs saving to casual observers. A book is a more obviously finite resource; it can be lost or damaged. But the internet feels so accessible. Anyone with an internet connection can pull up a web browser and dial in a URL. It's all right there – until it isn't.

Source: https://www.bbc.com/future/article/20240912...ve-the-internet
blmse92
post Sep 19 2024, 03:17 PM

On my way
****
Junior Member
603 posts

Joined: Dec 2011
From: shah alam



msn messenger


*nudge*


owai
lagenda110
post Sep 19 2024, 03:27 PM

Getting Started
**
Junior Member
148 posts

Joined: Jul 2022
i cant remember the pron title that i watched from vcd back in 2005 dang
TShaya
post Sep 19 2024, 03:31 PM

Sarawakian first!
*******
Senior Member
2,067 posts

Joined: Jan 2003

QUOTE(lagenda110 @ Sep 19 2024, 03:27 PM)
i cant remember the pron title that i watched from vcd back in 2005 dang
*
Do you still have the VCD?
lagenda110
post Sep 19 2024, 04:04 PM

Getting Started
**
Junior Member
148 posts

Joined: Jul 2022
QUOTE(haya @ Sep 19 2024, 03:31 PM)
Do you still have the VCD?
*
no
Dweller
post Sep 19 2024, 04:09 PM

Getting Started
**
Junior Member
105 posts

Joined: Jul 2013
Forum and Reddit will become the sources of useful information. I have been using reddit and lowyat keyword in my search term whenever I want to find information and very rarely it disappoints.
Zot
post Sep 19 2024, 04:13 PM

Look at all my stars!!
*******
Senior Member
7,938 posts

Joined: Mar 2014
Internet last forever but persons operating it don't.
azbro
post Sep 19 2024, 04:23 PM

Look at all my stars!!
*******
Senior Member
4,403 posts

Joined: Jan 2007
From: Johor Bahru


No wonder many of my childhood local prawn dishes disappear already.
TShaya
post Sep 20 2024, 10:02 AM

Sarawakian first!
*******
Senior Member
2,067 posts

Joined: Jan 2003

QUOTE(lagenda110 @ Sep 19 2024, 04:04 PM)
no
*
I wonder is anyone archiving/preserving porn? 🤔
TShaya
post Nov 4 2024, 01:13 PM

Sarawakian first!
*******
Senior Member
2,067 posts

Joined: Jan 2003

What would be lost if the Internet Archive were no more?
October 29, 20245:19 PM ET
Heard on All Things Considered
By Vincent Acovino, John Ketchum, Mary Louise Kelly, Zazil Davis-Vazquez

NPR's Mary Louise Kelly talks with Brewster Khale, the founder of Internet Archive, about the attack by hackers that put the archive offline for days — and what may have happened if it had succeeded.

MARY LOUISE KELLY, HOST:

Who preserves what happens on the internet? Well, one answer can be found in an old church in San Francisco, which is home to the Internet Archive. That's a nonprofit with a bigger digital collection than the Library of Congress. The Archive holds a record of billions of web pages on its servers. This month, it was attacked by hackers. It was offline for days, prompting a question - what would get lost if the Internet Archive is no more? Brewster Kahle is the founder of Internet Archive, and he's on the line now from San Francisco. Welcome.

BREWSTER KAHLE: Thank you very much, Mary Louise.

KELLY: So I just checked. Your site looks like it's up and running. It's online. Is everything back and fully restored?

KAHLE: It's not fully restored, but the basic services of archive.org and the Wayback Machine - yes, our library is back, which is really wonderful.

KELLY: The Wayback Machine - just to inject - this is the archival service for your website.

KAHLE: Yes, yes. That's where we collect web pages by working with 1,300 libraries, and then we make a service - a free service to make it so you could see the web as it was.

KELLY: So can you just describe what these last couple of weeks have looked like for you as you've tried to get the website back online and keep it up and running?

KAHLE: Oh, it's been so hard. Basically, we needed to upgrade our security significantly because, basically, somebody was able to deface our website. There's no damage to the data, which is great, but they did get our usernames and email addresses of patrons of the Internet Archive. So people's passwords continue to be safe because they were all encrypted, but the list of email addresses doesn't help the spam problem out there.

KELLY: Yeah. So why would hackers target you?

KAHLE: We don't know, but we're not the only ones. So, yes, we were targeted with cyberattacks, but also Calgary Public Library got hit, and they're still offline. Seattle Public Library system was attacked, and they're mostly back up - Toronto Public Library. And the big one was British Library was attacked a year ago, and still some of their services aren't back up.

KELLY: There's no ransom that's been demanded, nothing like that?

KAHLE: No. No, there hasn't been that. It's mostly been embarrassing. So we have gotten the message, and we're addressing - putting in more and better firewalls and code sanitization to go and make sure that the services stay up as people are expecting.

KELLY: Yeah. There's obviously a rich irony here for a site dedicated to preserving activity on the internet potentially being wiped off the internet. What would be lost if a cyberattack had actually succeeded at damaging your site permanently?

KAHLE: Well, if the Internet Archive were to go away - it has unique collections of the World Wide Web and also books, music, video, television for decades of television. It's a full-fledged library. So the idea of losing the Internet Archive is not just the web collection, but it would be a whole lot more.

KELLY: What you're describing, Mr. Kahle, is such a huge responsibility, protecting all of this information. Are you confident you can do it?

KAHLE: We'll do everything we can. As part of the job, we have preservation and access sort of tattooed into our souls. So preservation to go and make sure that the information that we've collected stays, and then we - can we make it as accessible as possible? We first wanted to make sure that our preservation function was in good shape, which we did. And then we worked very hard to try to get the access back up. But we're seeing attacks on libraries from many different angles, and this is just another one.

KELLY: That is Brewster Kahle, digital librarian and founder of the Internet Archive. Thank you.

KAHLE: Thank you.

Source: https://www.npr.org/2024/10/29/nx-s1-516245...ve-were-no-more
TShaya
post Jul 25 2025, 01:57 PM

Sarawakian first!
*******
Senior Member
2,067 posts

Joined: Jan 2003

Why are data nerds racing to save US government statistics?
By MIKE SCHNEIDER
Internet
Friday, 25 Jul 2025 9:00 AM MYT

The data nerds are fighting back.

After watching data sets be altered or disappear from US government websites in unprecedented ways after President Donald Trump began his second term, an army of outside statisticians, demographers and computer scientists have joined forces to capture, preserve and share data sets, sometimes clandestinely.

Their goal is to make sure they are available in the future, believing that democracy suffers when policymakers don’t have reliable data and that national statistics should be above partisan politics.

"There are such smart, passionate people who care deeply about not only the Census Bureau, but all the statistical agencies, and ensuring the integrity of the statistical system. And that gives me hope, even during these challenging times,” Mary Jo Mitchell, director of government and public affairs for the research nonprofit the Population Association of America, said this week during an online public data-users conference.

The threats to the US data infrastructure since January have come not only from the disappearance or modification of data related to gender, sexual orientation, health, climate change and diversity, among other topics, but also from job cuts of workers and contractors who had been guardians of restricted-access data at statistical agencies, the data experts said.

"There are trillions of bytes of data files, and I can't even imagine how many public dollars were spent to collect those data.... But right now, they're sitting someplace that is inaccessible because there are no staff to appropriately manage those data,” Jennifer Park, a study director for the Committee on National Statistics, National Academies of Sciences, Engineering, and Medicine, said during the conference hosted by the Association of Public Data Users (APDU).

In February, the Center for Disease Control and Prevention’s official public portal for health data, data.cdc.gov, was taken down entirely but subsequently went back up. Around the same time, when a query was made to access certain public data from the US Census Bureau’s most comprehensive survey of American life, users for several days got a response that said the area was "unavailable due to maintenance” before access was restored.

Researchers Janet Freilich and Aaron Kesselheim examined 232 federal public health data sets that had been modified in the first quarter of this year and found that almost half had been "substantially altered," with the majority having the word "gender” switched to "sex,” they wrote this month in The Lancet medical journal.

One of the most difficult tasks has been figuring out what's been changed since many of the alterations weren't recorded in documentation.

Beth Jarosz, senior program director at the Population Reference Bureau, thought she was in good shape since she had previously downloaded data she needed from the National Survey of Children's Health for a February conference where she was speaking, even though the data had become unavailable. But then she realised she had failed to download the questionnaire and later discovered that a question about discrimination based on gender or sexual identity had been removed.

"It's the one thing my team didn't have,” Jarosz said at this week's APDU conference. "And they edited the questionnaire document, which should have been a historical record.”

Among the groups that have formed this year to collect and preserve the federal data are the Federation of American Scientists' dataindex.com, which monitors changes to federal data sets; the University of Chicago Library’s Data Mirror website, which backs up and hosts at-risk data sets; the Data Rescue Project, which serves as a clearinghouse for data rescue-related efforts; and the Federal Data Forum, which shares information about what federal statistics have gone missing or been modified – a job also being done by the American Statistical Association.

The outside data warriors also are quietly reaching out to workers at statistical agencies and urging them to back up any data that is restricted from the public.

"You can't trust that this data is going to be here tomorrow,” said Lena Bohman, a founding member of the Data Rescue Project.

Separately, a group of outside experts has unofficially revived a long-running US Census Bureau advisory committee that was killed by the Trump administration in March.

Census Bureau officials won't be attending the Census Scientific Advisory Committee meeting in September, since the Commerce Department, which oversees the agency, eliminated it. But the advisory committee will forward its recommendations to the bureau, and demographer Allison Plyer said she has heard that some agency officials are excited by the committee's re-emergence, even if it's outside official channels.

"We will send them recommendations but we don't expect them to respond since that would be frowned upon,” said Plyer, chief demographer at The Data Center in New Orleans. "They just aren't getting any outside expertise... and they want expertise, which is understandable from nerds.” – AP

Source: https://www.thestar.com.my/tech/tech-news/2...ment-statistics
pandah
post Jul 25 2025, 02:14 PM

Enthusiast
*****
Senior Member
719 posts

Joined: Jul 2011

storage mahal
Knnbuccb
post Jul 25 2025, 02:17 PM

Getting Started
**
Junior Member
118 posts

Joined: Dec 2021
my bizhosting page which i created back in year 2000 in pri school... is no longer there
ixaRA
post Jul 25 2025, 02:36 PM

Getting Started
**
Junior Member
91 posts

Joined: Sep 2013



Last time so many forum.. now hard to find it anymore
TShaya
post Jul 25 2025, 02:42 PM

Sarawakian first!
*******
Senior Member
2,067 posts

Joined: Jan 2003

QUOTE(pandah @ Jul 25 2025, 02:14 PM)
storage mahal
*
But /k say why need so many Data Centres wor
jerm57
post Jul 25 2025, 02:54 PM

Casual
***
Junior Member
405 posts

Joined: Nov 2010
From: Penis ular Bolehland


The internut now also more sanitised. I remember the days when it wasn't very difficult to find hardcore content if you knew the keywords.
SUSpetpenyubobo
post Jul 25 2025, 07:11 PM

Regular
******
Senior Member
1,030 posts

Joined: Jan 2022

QUOTE(haya @ Jun 7 2024, 08:09 AM)
Nearly 40% of web pages from 10 years ago are no longer accessible
Maybe the internet doesn't last forever?
By Christianna Silva  on May 22, 2024

*
Epstein will be so happy he has been forgotten.

Why some social media sites don't allow account deletion or closure request by the owner?

In Europe, GDPR states it's a personal individual right to erase his data over the internet.

Everything you need to know about the “Right to be forgotten”
https://gdpr.eu/right-to-be-forgotten/
SUSpetpenyubobo
post Jul 25 2025, 07:16 PM

Regular
******
Senior Member
1,030 posts

Joined: Jan 2022

Right to Be Forgotten
https://firstamendment.mtsu.edu/article/rig...o-be-forgotten/

QUOTE
The right to be forgotten is a legal concept recognized in the European Union and other parts of the world but a concept foreign and contrary to established First Amendment principles.

A commentator for The Guardian referred to the right to be forgotten as “the right to have an imperfect past.”

The push for “the right to be forgotten” comes from the idea that one’s prior misdeeds or acts of bad judgment should not come up on Google searches or other online search engines forever, that individuals ought to have the ability to remove negative references. This concept places tension between privacy and free expression.
Right to be forgotten recognized in Europe, but not United States

The Court of Justice of the European Union recognized the concept in a 2014 ruling involving a Spanish lawyer who sought to have online references to prior debt removed online.  Legal commentator McKay Cunningham explains that the decision “set a broad precedent, conferring a new legal right to force erasure of links to data on the Internet” (Cunningham, 496).

The right to be forgotten means that individuals have a right under certain circumstances to force search engines to remove links about them from the past. American courts do not recognize this concept.
Suit against Google for invasion of privacy dismissed

Consider the case of actress Cindy Lee Garcia, who responded to a casting call for an upcoming movie called Desert Warrior, an action-adventure film.  In her cameo role, Garcia spoke two sentences for a total of five seconds airtime. Unknowingly to her, the writer-director used her lines in a different film called Innocence of Muslims.  Film producers showed Garcia but dubbed over her originally spoken lines with “Is Your Mohammed a child molester?”  As a result of this dub, which was broadcast over YouTube, Garcia received death threats.


When will they allow user self account deletion/closure in this forum?
TShaya
post Aug 5 2025, 06:47 PM

Sarawakian first!
*******
Senior Member
2,067 posts

Joined: Jan 2003


Amid a crackdown on civil society groups in the USA, reporter Whena Owen meets Wellington-based digital infrastructure activist Julian Oliver, who is working with NGO and activist groups to move their most sensitive data out of US-friendly jurisdictions, and into places where it might be more secure.

But, as Oliver explains, that doesn’t include New Zealand.
TShaya
post Dec 13 2025, 07:58 PM

Sarawakian first!
*******
Senior Member
2,067 posts

Joined: Jan 2003


How can information stay available over time in a digital production world?

 

Change to:
| Lo-Fi Version
0.0390sec    1.37    5 queries    GZIP Disabled
Time is now: 17th December 2025 - 02:56 AM