21 curated breaking changes across major versions of charset_normalizer. Use this as a migration checklist before bumping dependencies.
UTF-7 will no longer appear as "detected" without a recognized SIG/mark (is unreliable/conflict with ASCII)
Breaking: Top-level function `normalize`
Breaking: Properties `chaos_secondary_pass`, `coherence_non_latin` and `w_counter` from CharsetMatch
Support for the backport `unicodedata2`
Breaking: Class aliases CharsetDetector, CharsetDoctor, CharsetNormalizerMatch and CharsetNormalizerMatches
Support for the backport `unicodedata2` **This is the last version (3.0.x) to support Python 3.6** _We plan to drop it for 3.1.x_
Breaking: Top-level function `normalize`
Breaking: Properties `chaos_secondary_pass`, `coherence_non_latin` and `w_counter` from CharsetMatch
UTF-7 will no longer appear as "detected" without a recognized SIG/mark (is unreliable/conflicts with ASCII)
Breaking: Top-level function `normalize`
Support for the backport `unicodedata2`
Coherence detector no longer return 'Simple English' instead return 'English'
Coherence detector no longer return 'Classical Chinese' instead return 'Chinese'
Breaking: Method `first()` and `best()` from CharsetMatch
UTF-7 will no longer appear as "detected" without a recognized SIG/mark (is unreliable/conflict with ASCII)
Breaking: Class aliases CharsetDetector, CharsetDoctor, CharsetNormalizerMatch and CharsetNormalizerMatches
Breaking: Properties `chaos_secondary_pass`, `coherence_non_latin` and `w_counter` from CharsetMatch
Breaking: Top-level function `normalize`
Breaking: Properties `chaos_secondary_pass`, `coherence_non_latin` and `w_counter` from CharsetMatch
Support for the backport `unicodedata2`
Breaking: Class aliases CharsetDetector, CharsetDoctor, CharsetNormalizerMatch and CharsetNormalizerMatches
Get this data programmatically \u2014 free, no authentication.
curl https://depscope.dev/api/breaking/pypi/charset_normalizer