kernellogger,
@kernellogger@fosstodon.org avatar

Jeremy Allison writes:

'" The data shows that “frozen” vendor kernels, created by branching off a release point and then using a team of engineers to select specific patches to back-port to that branch, are buggier than the upstream “stable” Linux created by Greg Kroah-Hartman. '"

https://ciq.com/blog/why-a-frozen-linux-kernel-isnt-the-safest-choice-for-security/

jejb,
@jejb@mastodon.online avatar

@kernellogger I'm afraid I can't support the counting methodology in the paper either. Besides the not applicable because of config issues RH people cite, there's also the fact that not everything that has a cc: stable tag is an exploitable bug. Plus every fix backported carries risk (just look at the number of regressions in stable due to backports) so that risk has to be set against the benefit of the backport. A general rule would be if it's not exploitable don't backport it.

larsmb,
@larsmb@mastodon.online avatar

@kernellogger That's to be expected, but it is also not the point of them.
I agree they shouldn't need to exist, but the realities of how many many an organization manages their IT necessitates their existence.
The industry doesn't want to go through the withdrawal phase of building a better world.

bluca, (edited )
@bluca@fosstodon.org avatar

@kernellogger as usual, the point is not that these are bug free, but that they are regression free. The kernel upstream releases break userspace on every new release, and kernel maintainers don't care. See https://github.com/torvalds/linux/commit/a1912f712188291f9d7d434fba155461f1ebef66 for example, as Daan just found out, which removed a mount option without caring that it is still being used, so since 6.8 every btrfs device can no longer be mounted by systemd

kernellogger,
@kernellogger@fosstodon.org avatar

@bluca

Well, to claim "kernel maintainers don't care" you have to at least report the bug to them[1]. That afaics has not happened yet (or I could not find it).

"since 6.8 every btrfs device can no longer be mounted by systemd": then why was this only noticed 2+ months after a release with that commit went out? This raises the question: what kind of problem did users actually run into?

[1] yes, sure, ideally they would have done a code search first, but we are all imperfect…

bluca,
@bluca@fosstodon.org avatar

@kernellogger well, the kernel doesn't have a bug tracker - not for real anyway, bugzilla.kernel.org might as well be pointed to /dev/null, so no idea what "reporting" would even mean in this case. I do not use BTRFS so I am not affected, just sharing what was reported to me. It looks like it was reported against the Debian kernel package too now: https://bugs.debian.org/cgi-bin/bugreport.cgi?bug=1071420

kernellogger,
@kernellogger@fosstodon.org avatar

@bluca

reg. bug reporting:

https://docs.kernel.org/admin-guide/reporting-issues.html

https://docs.kernel.org/admin-guide/reporting-regressions.html

Some of it does not apply in this case.

I also make sure to handle regressions that are submitted to bugzilla.kernel.org

felix,
@felix@misskey.io avatar

@kernellogger @bluca but, following upstream, passthrough patches, and maintain downstream compatibility, is the duty of a middleware, like systemd, from kernel to user interface, no?

kernellogger,
@kernellogger@fosstodon.org avatar

@felix @bluca

duty? no!

nice behaviour, cooperative, up to some point kinda expected, and what most people do: sure.

kernellogger,
@kernellogger@fosstodon.org avatar

@bluca

and thx for the link to the debian bug tracker; but I want to see more details first what went wrong there, as I'd expect it would be pretty unlikely that this is the first debian btrfs user that updated to 6.8 or higher; so why did it break for that user, but apparently not for the others?

bluca,
@bluca@fosstodon.org avatar

@kernellogger 6.8 has just arrived in Debian unstable 3 days ago: https://tracker.debian.org/pkg/linux

bluca,
@bluca@fosstodon.org avatar
kernellogger, (edited )
@kernellogger@fosstodon.org avatar

@bluca

thx, yeah, I already have been watching that.

1/ FWIW, I think you owe the kernel developers an apology, as you made a lot of noise and claimed "kernel maintainers don't care", when they clearly do once the problem was properly reported -- and quite quickly even. And yes, sure, in the ideal world they would have cared some more and performed a code-search before removing this option to prevent it in the first place. But we are all imperfect and make mistakes. Same for @pid_eins, who…

kernellogger, (edited )
@kernellogger@fosstodon.org avatar

@bluca @pid_eins

2/ …wrote "And my main beef here is that they claim they wouldnt do it ever..."[1], as that is not even true. They often try changes or removals to see if it breaks something – and if it does, it's reverted. Even the removal of the support for the original i386 was handled like that by Linus himself.

[1] https://mastodon.social/

pid_eins,
@pid_eins@mastodon.social avatar

@kernellogger @bluca sure, but then the rule is not "we never break userspace" but more "move fast and break things, and sometimes revert where people protest too loudly".

I mean, that's fine by me, but maybe they should communicate it like that then.

The thing is that removing a widely documented mount option is very obviously a compat breakage. You cannot discount that. It's not just a "mistake" to remove something like that, it's an obvious attempt to break compat.

kernellogger,
@kernellogger@fosstodon.org avatar

@pid_eins @bluca

the exact relevant rule is just "no regressions", nothing more. Everything else is just left to the interpretation by people (in typical Linus manner you might say). 🥴

pid_eins,
@pid_eins@mastodon.social avatar

@kernellogger @bluca

Actually, the exact relevant rule is "WE DO NOT BREAK USERSPACE", all in uppercase.

https://lkml.org/lkml/2012/12/23/75

I find the sound of that mail quite different from your much weaker "let's maybe undo the worst shit if people complain too loudly"... And of course "uh, sometimes we fucked up so hard, we cannot fix it anymore, let's add a new api instead" (which is what happened in the block device capabilities/media change api).

pid_eins,
@pid_eins@mastodon.social avatar

@kernellogger @bluca

(again, I actually find it OK if API is broken from time to time, just be honest about it, and communicate properly, and do a bit of research first. Don't claim that uppercase extremism and then do not even superficially follow through)

kernellogger,
@kernellogger@fosstodon.org avatar

@pid_eins @bluca

hmmm:

$ grep -ri 'no regressions' Documentation/ | wc -l
13

$ grep -ri 'not break userspace' Documentation/ | wc -l
0

Also:

"WE DO NOT BREAK USERSPACE": 2 hits – https://lore.kernel.org/all/?q=f%3ATorvalds+%22WE+DO+NOT+BREAK+USERSPACE%22

"no regresssions": 44 hits –https://lore.kernel.org/all/?q=f%3ATorvalds%20%22no%20regressions%22

SchwarzeLocke,
@SchwarzeLocke@ohai.social avatar

@kernellogger @pid_eins My impression, having more of an outside perspective and working with higher level languages: should deprecations perhaps always be gated with a config flag, perhaps even a common one similar to BROKEN?

With Java/Scala, it's always quite clear for me where deprecated methods are used. Also I can have builds fail due to that or not, so that I notice new deprecations when building / in CI.

kernellogger,
@kernellogger@fosstodon.org avatar

@SchwarzeLocke @pid_eins

there are various things that can work and I guess it depends on the situation what reasonable and effective.

For the kernel I something think "add delays (together with a msg in the logs) that grow longer and longer over time when people use deprecated stuff, at some point people get curious and will investigate" might be something that might help, OTOH it's a kind of stupid idea 😂

pavel,

@kernellogger a) distros want no-regressions, not no-bugs. b) -stable people are CVE authority.

kernellogger,
@kernellogger@fosstodon.org avatar

@pavel

reg. the "distros want no-regressions, not no-bugs":

from my point of view the whole situation could be a lot better if distros would spend some of the money they currently invest in CI instead invest in working on workflow improvements and some others stuff to ensure regressions do not happen in the first place or are quickly resolved.

thefossguy,
@thefossguy@fosstodon.org avatar

@kernellogger @pavel I’d be interested in knowing how you would improve the workflows. What’s missing, what can be improved and what shouldn’t be done. I would love to help with this however I can. :)

kernellogger,
@kernellogger@fosstodon.org avatar

@thefossguy @pavel

There is no easy answer here, as it are lots of details; but there is a decent chance I need to write this up soon anyway; if I do, I'll get back to you!

vegard,
@vegard@mastodon.social avatar

@kernellogger On the other hand, what's the difference between a distro branching off and backporting stuff from mainline and upstream stable branching off and backporting stuff from mainline? Why can the upstream stable maintainers do this and "a team of engineers" cannot? I think the difference could be better characterized and (if you pardon the expression) makes all the difference.

gregkh,

@vegard @kernellogger "a team of engineers" COULD do that (hint, Android does, by just taking the stable updates), but the paper shows that a specific "team of engineers" currently does NOT do that, which puts the users of those kernels potentially at a greater risk.

Read the paper, it's interesting.

Disclosure, I read it before it was published, but had no influence on it at all.

  • All
  • Subscribed
  • Moderated
  • Favorites
  • linux
  • khanakhh
  • DreamBathrooms
  • mdbf
  • tester
  • ngwrru68w68
  • magazineikmin
  • cubers
  • InstantRegret
  • rosin
  • Youngstown
  • slotface
  • everett
  • kavyap
  • GTA5RPClips
  • megavids
  • osvaldo12
  • Leos
  • tacticalgear
  • normalnudes
  • thenastyranch
  • ethstaker
  • Durango
  • modclub
  • provamag3
  • anitta
  • cisconetworking
  • JUstTest
  • lostlight
  • All magazines