Some installer tasks

Been poking about at MediaWiki, but not sure what to do? Here’s a few tasks that would help with some common problems for third-party users:

  • 1379: Install can’t find config/index.php
    Some hosting services put a “control panel” of some sort at the “/config” URL, making it difficult to get at the MediaWiki installer. Renaming this to something more unique, and providing a compatibility link for convenience, would not be very hard but would help people stuck on this sort of host.
  • 9954: Detect “extra whitespace” / BOM conditions
    PHP is very picky about extra whitespace at the start and end of source files. Unfortunately it’s not uncommon for people to end up with extra blank lines or a hidden Unicode BOM sequence at the start or end of files they’ve customized. This leads to weird, hard to diagnose problems like cookies not getting set or RSS feeds that break with little explanation. Some software support to detect this and report which file is broken (and how to fix it!) would be very helpful.
  • 10387: Detect and handle ‘.php5’ extension environments
    More and more hosting services are providing PHP 5.x, but some are putting it alongside existing PHP 4.x services, requiring that files be named with a .php5 extension. With a little care, the installer could detect this out of the box and set things up to work on such systems with few problems.
    Update 2007-06-28: Edward Z. Yang whipped up a good patch for this, which I’ve commited to trunk.

Wikimedia page views

For the curious, here’s some statistics I whipped up the other day from our sampled logs. These are counts of plain article page views on all of Wikimedia’s wikis, from about three days’ worth of a 1/1000-sampled log. Images, style sheets, edit actions, diffs, special pages, etc are excluded.

I’ve broken it down by referrer, with some rough groupings. It’s not very scientific, but some might find it interesting. :)

We should be able to process similar statistics on an automated basis from the log server now that we’ve got it set up, including breakdowns by site and by language.

Referrer Samples Percentage Daily extrapolation Monthly extrapolation
total 71873 100.0000 233,610,232 7,008,306,975
in-wiki link 31512 43.8440 102,424,076 3,072,722,293
Google 17102 23.7947 55,586,969 1,667,609,059
no referrer 11873 16.5194 38,591,047 1,157,731,397
IE7 gadget 3862 5.3734 12,552,735 376,582,048
other Wikimedia 2174 3.0248 7,066,195 211,985,855
in-wiki search 2140 2.9775 6,955,684 208,670,529
Yahoo 1507 2.0968 4,898,232 146,946,957
other 1162 1.6167 3,776,872 113,306,147
MSN 208 0.2894 676,067 20,281,995
Live 189 0.2630 614,310 18,429,313
AOL 84 0.1169 273,027 8,190,806
Ask 39 0.0543 126,762 3,802,874
AltaVista 21 0.0292 68,257 2,047,701