Message 338388 - Python tracker

Message338388

Author	rhettinger
Recipients	Mariatta, dfrojas, eli.bendersky, lukasz.langa, matrixise, mdk, nedbat, rhettinger, scoder, serhiy.storchaka, sivert, taleinat, vstinner
Date	2019-03-19.16:58:02
SpamBayes Score	-1.0
Marked as misclassified	Yes
Message-id	<1553014682.17.0.534609190401.issue34160@roundup.psfhosted.org>
In-reply-to

Content
> how that's acceptable? For docutils, we'll most likely propose some variant of Stéphane Wirtel's script to test semantic equivalence for docutils. For other cases, Serhiy is working on a C14N canonicalization tool which is specifically designed for the task of creating reproducible output, in a cross-language standards compliant way. As Stefan Behnel clearly articulated, there are multiple reasons why Python should not guarantee byte-for-byte serialization across point releases. That said, we'll likely make the guarantee across micro-releases. That will make it possible a third mitigation strategy of generating new baseline files for a new point releases and adding a version check to decide which baseline to test against. FWIW, we had a similar discussion regarding hash randomization. While there are a number of significant differences, the outcome is relevantL User tests that depended on non-guaranteed implementation details had to be fixed.

Content

> how that's acceptable?

For docutils, we'll most likely propose some variant of Stéphane Wirtel's script to test semantic equivalence for docutils.  For other cases, Serhiy is working on a C14N canonicalization tool which is specifically designed for the task of creating reproducible output, in a cross-language standards compliant way.

As Stefan Behnel clearly articulated, there are multiple reasons why Python should not guarantee byte-for-byte serialization across point releases.  That said, we'll likely make the guarantee across micro-releases.  That will make it possible a third mitigation strategy of generating new baseline files for a new point releases and adding a version check to decide which baseline to test against.

FWIW, we had a similar discussion regarding hash randomization.  While there are a number of significant differences, the outcome is relevantL  User tests that depended on non-guaranteed implementation details had to be fixed.

History
Date	User	Action	Args
2019-03-19 16:58:02	rhettinger	set	recipients: + rhettinger, scoder, vstinner, taleinat, nedbat, eli.bendersky, lukasz.langa, serhiy.storchaka, matrixise, sivert, mdk, Mariatta, dfrojas
2019-03-19 16:58:02	rhettinger	set	messageid: <1553014682.17.0.534609190401.issue34160@roundup.psfhosted.org>
2019-03-19 16:58:02	rhettinger	link	issue34160 messages
2019-03-19 16:58:02	rhettinger	create