Millions of broken Wikipedia links fixed


Millions of broken Wikipedia links fixed


Nine million broken Wikipedia links have been fixed thanks to an alliance with the Internet Archive.

The online encyclopedia’s editors have long been encouraged to provide links to web-based sources. But the details can be lost if the third-party sites close or update their pages.

To address this, visitors are now pointed to snapshots of what the sites used to show, when required.

It has also emerged that some editors voted to restrict use of Breitbart.

A post on Wikipedia’s reliable sources page states that there was a “very clear consensus” that the right-wing news site should stop being used as a source for facts “due to its unreliability”.

It suggested that Breitbart could still, however, be used to attribute viewpoints.

But some editors were concerned by the idea.

“Breitbart should be used with caution – but an outright ban on citing it would hurt Wikipedia far more than help,” wrote one.

The BBC has contacted Breitbart for a response.

It follows a similar move against the Daily Mail last year.

Volunteers were subsequently encouraged to review existing Wikipedia citations of the UK newspaper and either remove or replace them.

The Motherboard news site has reported that Wikipedia editors have also advocated similar limits on the use of articles by the left-wing Occupy Democrats organisation and the conspiracy-theory media platform InfoWars.

Link rot

The collaboration with the Internet Archive makes use of the San Francisco’s based project’s Wayback Machine tool.

This allows users to enter a web address and then find stored versions of how a page appeared on dates in the past.

The non-profit said it had begun using automated software three years ago to hunt out links that resulted in “page not found” or “404” and “500” errors.

This bot then searched the Wayback Machine for the relevant information and automatically updated the links.

This, the archive’s director said, had resulted in six million pages lost to “link rot” being restored.

The bot was designed to seek out and fix broken links

Mark Graham added that members of the Wikipedia community had also helped tackle a related issue – “content drift”.

This occurs when a page remains online but its text and images change so that they no longer resemble what the editor who linked to them had intended.

These human volunteers had fixed more than three million links to date, the director wrote.

“We will expand our efforts to check and edit more Wikipedia sites and increase the speed which we scan those sites and fix broken links,” Mr Graham concluded, adding that he also intended to explore whether Wikipedia’s contributors could be encouraged to use Wayback Machine snapshots in the first place rather than live-web links.





Source link


Like it? Share with your friends!

16
0
16 shares

What's Your Reaction?

eiii eiii
0
eiii
hate hate
0
hate
confused confused
0
confused
fail fail
0
fail
fun fun
0
fun
geeky geeky
0
geeky
love love
0
love
lol lol
0
lol
win win
0
win

0 Comments

Your email address will not be published. Required fields are marked *

You may also like

More From: Technology

DON'T MISS

Choose A Format
Personality quiz
Series of questions that intends to reveal something about the personality
Trivia quiz
Series of questions with right and wrong answers that intends to check knowledge
Poll
Voting to make decisions or determine opinions
Story
Formatted Text with Embeds and Visuals
List
The Classic Internet Listicles
Countdown
The Classic Internet Countdowns
Open List
Submit your own item and vote up for the best submission
Ranked List
Upvote or downvote to decide the best list item
Meme
Upload your own images to make custom memes
Video
Youtube, Vimeo or Vine Embeds
Audio
Soundcloud or Mixcloud Embeds
Image
Photo or GIF
Gif
GIF format