18,700
edits
(→Coming features: oops, MW's API cannot tell you which revision a URL was added in -- that requires a binary search of all page revisions -- but this IA API meets our needs on its own) |
m (→Coming features: wording) |
||
Line 38: | Line 38: | ||
*Detection of "external internal" links, which occur when someone puts the full URL to another page on this same wiki, like <nowiki>"[http://wiki.oni2.net/OniGalore:Current_events]"</nowiki> instead of <nowiki>"[[OniGalore:Current events]]"</nowiki>. | *Detection of "external internal" links, which occur when someone puts the full URL to another page on this same wiki, like <nowiki>"[http://wiki.oni2.net/OniGalore:Current_events]"</nowiki> instead of <nowiki>"[[OniGalore:Current events]]"</nowiki>. | ||
*Detection of external interwiki links, where the editor fails to take advantage of an [[Help:Editing#Interwiki_links|interwiki prefix]] which would have made their link shorter and more resistant to rot. | *Detection of external interwiki links, where the editor fails to take advantage of an [[Help:Editing#Interwiki_links|interwiki prefix]] which would have made their link shorter and more resistant to rot. | ||
*It can be tedious to find a valid version of an old page on the Internet Archive by browsing in the Wayback Machine. Fortunately, the Archive offers an API for finding valid snapshots, built for use by Wikipedia's [[metawikipedia:InternetArchiveBot|InternetArchiveBot]]. [http://archive.org/wayback/available?url=http://www.pbs.org/wnet/religionandethics/week622/hedges.html&statuscodes=200&statuscodes=203&statuscodes=206 Here] is an sample query that | *It can be tedious to find a valid version of an old page on the Internet Archive by browsing in the Wayback Machine. Fortunately, the Archive offers an API for finding valid snapshots, built for use by Wikipedia's [[metawikipedia:InternetArchiveBot|InternetArchiveBot]]. [http://archive.org/wayback/available?url=http://www.pbs.org/wnet/religionandethics/week622/hedges.html&statuscodes=200&statuscodes=203&statuscodes=206 Here] is an sample query that passes a URL to the Archive and asks for the latest snapshot where the server holding the original page returned an OK code to the Archive's web crawler. Note that a server returning "OK" does not guarantee that the page it returned actually has the desired content; see next point. | ||
*At a cursory glance, I can see that many external links no longer display the content that they were intended to display. In many cases, web sites are silently redirecting the user to their main page without using the appropriate code that indicates the content was not found. | *At a cursory glance, I can see that many external links no longer display the content that they were intended to display. In many cases, web sites are silently redirecting the user to their main page without using the appropriate code that indicates the content was not found. Only visual inspections of the pages can catch these issues. Once we have dealt with the low-hanging fruit of pages that return NG codes, the screenshot feature in the script will be activated, which will allow us to easily confirm if the "OK" links (and "OK" snapshots from the Internet Archive) are actually loading the intended page. | ||
==PlayStation 2 port== | ==PlayStation 2 port== |