Linux – How to find out which Linux kernel versions are the most prevalent

linux-kernel

I am gearing up to do some scientific experiments that involve modifying the linux kernel to collect data on the internal runtime states of certain modules. We want to do the experiments over a handful of kernel versions that represent what's actually being used in real datacentres.

Question: How do I come up with the list of kernel versions to test? I can certainly pick some versions myself, but it would have more scientific credibility if there was some reference we could point to justify our choices. Even something informal like DistroWatch.com, but for kernel versions, would be helpful. Or possibly scraping download statistics from package managers of popular distributions, if that data is public.

[Note: I asked this first on ServerFault, but it was closed as "not relevant to systems administrators". Hopefully it's more on-topic here]

Best Answer

[Self answer]

While this is less than satisfying, we essentially went with @sjsam's advice and built a list of kernel versions by looking at the default kernel versions that ship with RedHat Enterprise Linux.

Looking at versions of RHEL that are still in support today (April 2016), this gives us the list:

  • 2.6.18
  • 2.6.32
  • 3.10.0
  • 4.X (just for good measure, I should test on whatever's current when we do the experiments)

I have no way of knowing how well this list represents what's "what's actually being used in real datacentres", but it's something. If we assume that datacentre engineers don't update their kernel from the default, then it's probably reasonably good.

Related Question