learned and not forgotten

windows 1252 encoding

CP1252 is windows encoding

em dash is a measurement of font size that is often double encoded?
to find it do:
zcat file | grep -P '\xc2\x96'

91,92,93,94 are also other troublesome windows chars




Labels: encoding
Newer Post Older Post Home

Labels

command line (7) glossary (4) linguistics (4) Perl (2) encoding (2) health (2) pregnancy (2) python (2) terminology (2) this blog (2) awk (1) curiosity (1) data struct + algorithms (1) education (1) genealogy (1) grep (1) scripting (1) sed (1)

Blog Archive

  • ►  2014 (1)
    • ►  April (1)
  • ►  2013 (6)
    • ►  October (1)
    • ►  July (4)
    • ►  April (1)
  • ▼  2012 (28)
    • ►  December (1)
    • ►  September (1)
    • ►  July (4)
    • ►  June (1)
    • ►  May (4)
    • ▼  April (5)
      • free online courses
      • using "screen" command to handle multiple sessions
      • awk
      • iconv and recode to change file encoding
      • windows 1252 encoding
    • ►  March (2)
    • ►  February (9)
    • ►  January (1)
  • ►  2011 (1)
    • ►  June (1)
Simple theme. Powered by Blogger.