I keep getting errors while manually testing my decrypting function within the homework 6 program01.py program. My program produces correct keys for all the tests. The texts for test01 is correct in the output, and so are the first 4 lines of test02, however in character with index 244 it has 'â', '€', 'Š' which corresponds to the hex: e2 80 8a which corresponds to U+200A which is a "hair space":
https://www.utf8-chartable.de/unicode-utf8-table.pl?start=8192&number=128
I think this is because the test text is copied from the information theory wikipedia page (https://en.m.wikipedia.org/wiki/Information_theory ), and at this character index there is a superscript "vii" which was not taken out of the test text. Is this something we should be working around as an exercise or is this a mistake? It caused me to waste a lot of time to understand the issue, and although other people may have gotten around it, my algorithm didn't allow me to ignore this character.
https://www.utf8-chartable.de/unicode-utf8-table.pl?start=8192&number=128
I think this is because the test text is copied from the information theory wikipedia page (https://en.m.wikipedia.org/wiki/Information_theory ), and at this character index there is a superscript "vii" which was not taken out of the test text. Is this something we should be working around as an exercise or is this a mistake? It caused me to waste a lot of time to understand the issue, and although other people may have gotten around it, my algorithm didn't allow me to ignore this character.