1953: "The History of Unicode"

This forum is for the individual discussion thread that goes with each new comic.

Moderators: Moderators General, Prelates, Magistrates

User avatar
cellocgw
Posts: 1916
Joined: Sat Jun 21, 2008 7:40 pm UTC

Re: 1953: "The History of Unicode"

Postby cellocgw » Mon Feb 12, 2018 1:27 pm UTC

da Doctah wrote:
Mikeski wrote:
jc wrote:I know a few who like to call themselves Mainiacs.

The only former-Bostonian I've worked with called himself a Masshole.

Kinda makes me wish there was a good self-deprecating term for "Minnesotan". Other than "Canadian".


I think you mean "Baja Canadian".


as opposed to, maybe, "Bwahaha Canadian" ?
https://app.box.com/witthoftresume
Former OTTer
Vote cellocgw for President 2020. #ScienceintheWhiteHouse http://cellocgw.wordpress.com
"The Planck length is 3.81779e-33 picas." -- keithl
" Earth weighs almost exactly π milliJupiters" -- what-if #146, note 7

Godsguy
Posts: 21
Joined: Thu Mar 28, 2013 2:10 pm UTC

Re: 1953: "The History of Unicode"

Postby Godsguy » Mon Feb 12, 2018 1:42 pm UTC

ivnja wrote:
Eternal Density wrote:(What's the word for people who live in Maine?)
Mainers, although I don't think there's technically an official demonym.

jc wrote:I know a few who like to call themselves Mainiacs.
And then there's the ultra-special breed, the Maineiacs, who you'll find screaming abuse at opposing goaltenders from the student section balcony at the Alfond every other Friday and Saturday night from mid-October to mid-February. (Go Blue!)



While the rest of the country might spell/pronounce it "Mainer", the true name is closer to "Mainah".

You people and your R's. Leave em at the bordah.....
There is nothing, absolutely nothing, half so much worth doing as simply messing around in boats - Ken Grahame (Wind in the Willows)

ebow
Posts: 36
Joined: Wed Oct 03, 2012 4:16 pm UTC
Location: Chair

Re: 1953: "The History of Unicode"

Postby ebow » Mon Feb 12, 2018 2:47 pm UTC

Godsguy wrote:While the rest of the country might spell/pronounce it "Mainer", the true name is closer to "Mainah".

You people and your R's. Leave em at the bordah.....


I'd like to see some dater to support the idear that R's need not apply.

(Or is that only the case in mainland Massachusetts, not in the District of Maine?)

Shifty
Posts: 7
Joined: Sun Aug 18, 2013 7:24 pm UTC

Re: 1953: "The History of Unicode"

Postby Shifty » Tue Feb 13, 2018 4:39 pm UTC

J%r wrote:Did the image change? Somehow I only see the top rectangle.


When the cartoon was originally posted, the date on the left of the third panel was "1998" instead of "1988". I suspect correcting that was the cause of the momentary glitching.

User avatar
jc
Posts: 345
Joined: Fri May 04, 2007 5:48 pm UTC
Location: Waltham, Massachusetts, USA, Earth, Solar System, Milky Way Galaxy
Contact:

Re: 1953: "The History of Unicode"

Postby jc » Tue Feb 13, 2018 6:02 pm UTC

Godsguy wrote:While the rest of the country might spell/pronounce it "Mainer", the true name is closer to "Mainah".

You people and your R's. Leave em at the bordah.....


But an R seems to have crept in somehow. It should be "boadah".

xtifr
Posts: 327
Joined: Wed Oct 01, 2008 6:38 pm UTC

Re: 1953: "The History of Unicode"

Postby xtifr » Thu Feb 15, 2018 8:58 pm UTC

So yes, why does the forum block random unicode characters like Egyptian Hieroglyphs? I can understand blocking things like the bidi control characters, and I can maybe understand blocking the penis hieroglyph (and the ejaculating penis heiroglyph), but the whole code block? That's just bizarre.

In any case, though I can't post it (and that may be just as well), I do want to remind aubergine users that there are perfectly good unicode characters for the penis itself: U+130B8 and U+130BA. Not as colorful as the aubergine, perhaps, but much less ambiguous.
"[T]he author has followed the usual practice of contemporary books on graph theory, namely to use words that are similar but not identical to the terms used in other books on graph theory."
-- Donald Knuth, The Art of Computer Programming, Vol I, 3rd ed.

User avatar
Soupspoon
You have done something you shouldn't. Or are about to.
Posts: 3503
Joined: Thu Jan 28, 2016 7:00 pm UTC
Location: 53-1

Re: 1953: "The History of Unicode"

Postby Soupspoon » Thu Feb 15, 2018 10:40 pm UTC

xtifr wrote:So yes, why does the forum block random unicode characters like Egyptian Hieroglyphs? I can understand blocking things like the bidi control characters, and I can maybe understand blocking the penis hieroglyph (and the ejaculating penis heiroglyph), but the whole code block? That's just bizarre.

Whitelisting?

Start with selected ASCII codes lower than 32 (CR/LF accepted but standardised, no BELL?) and the whole of the 32 to 126 range, then expand into allowing long-accepted Codepage/Unicode territories, certainly, but most of the Emoji pages just lower the information density.

Like I often cannot identify elements in a 'Graphics Pack'ed Dwarf Fortress screenshot, because that little (exquisitely drawn!) pictorial array of pixels makes less sense to me than a given colour of Codepage-437 character image, most of the time.

Words, though. They work well enough, most of the time, with a good application of punctuation, a limited number of standardised in-line smiley images, proper mark-up and faux mark-up ([explanation]maybe for irony[/explanation]).

Sending me an aubergine symbol would still confuse me, though I have been aware of its popular symbology (a minced reference, to get around penis-symbol blacklisting, no doubt).

Sending me the 1F64F "PERSON WITH FOLDED HANDS symbol (which you can't do, here, but if you could I'd advise you to drastically increase the font-size for me to have a hope to identify it without some form of pinch-zooming) confuses me more as it looks more like a "deep in prayer" symbol. Is that what you meant? Apparently Randall used it as "thank you", in the comic title-text example. But "Thank God! (for helicopter heroes? - though I don't see the helicopter or medal symbols myself)" might be the message I take away, which is quite a different meaning from most of the various possibilities a sender might intend.

(Heck, even some of the authorised in-line smiley symbols are ambiguous. :| )

User avatar
ucim
Posts: 6410
Joined: Fri Sep 28, 2012 3:23 pm UTC
Location: The One True Thread

Re: 1953: "The History of Unicode"

Postby ucim » Thu Feb 15, 2018 11:30 pm UTC

Problem with Unicode and Emojis is that they don't go far enough. We should have a single symbol for every possible internet thought. Think of all the typing it would save!

(Upon further reflection, we can probably have a symbol for every possible internet thought just using the upper case ASCII character set.)

Jose
Order of the Sillies, Honoris Causam - bestowed by charlie_grumbles on NP 859 * OTTscar winner: Wordsmith - bestowed by yappobiscuts and the OTT on NP 1832 * Ecclesiastical Calendar of the Order of the Holy Contradiction * Please help addams if you can. She needs all of us.

User avatar
chridd
Has a vermicelli title
Posts: 822
Joined: Tue Aug 19, 2008 10:07 am UTC
Location: ...Earth, I guess?
Contact:

Re: 1953: "The History of Unicode"

Postby chridd » Mon Feb 19, 2018 5:38 pm UTC

xtifr wrote:So yes, why does the forum block random unicode characters like Egyptian Hieroglyphs? I can understand blocking things like the bidi control characters, and I can maybe understand blocking the penis hieroglyph (and the ejaculating penis heiroglyph), but the whole code block? That's just bizarre.
It has problems with any character with a code above U+FFFF, i.e., any character that's more than two bytes in UTF-16 (which include, among other things, hieroglyphs and most emoji). (...which is odd, because an older version of the forum software allowed them just fine, and last I checked old posts that have the characters still display okay.)
~ chri d. d. /tʃɹɪ.di.di/ (Phonotactics, schmphonotactics) · she(?)(?(?)(?))(?(?(?))(?))(?) · Forum game scores
mittfh wrote:I wish this post was very quotable...
flicky1991 wrote:In both cases the quote is "I'm being quoted too much!"
chridd (on Discord) wrote:
Dummy wrote:Sorry You're Gay Dads
SYG'D

User avatar
Soupspoon
You have done something you shouldn't. Or are about to.
Posts: 3503
Joined: Thu Jan 28, 2016 7:00 pm UTC
Location: 53-1

Re: 1953: "The History of Unicode"

Postby Soupspoon » Mon Feb 19, 2018 6:07 pm UTC

I say that "It has problems with…" is probably more like "It has been told to refuse…". A filter that did not exist in prior forum backends, perhaps, thus messages already featuring them still do1 but the current generation of the backend is (with possibly good reasons) set up to reject characters that are technically passable, but blanket banned either as default (not unbanned, locally) or as a chosen installation option by our senior moderator(s).

And that's that, I think, until this is brought up in the forum-admin areas (public or otherwise) and the option to allow it is posited, indicated as possible and then made so by those with sufficient bit-flipping rights on the server. Not worth arguing about here, I would say (unless someone with coloured text cares to just pass by here say that this is not the case, and it's no use petitioning anybody to change something that isn't doable in the first place, for far more complex reasons than I have been guessing about).


1 OTOH, I note many "unavailable image" inserts, without having checked where it thinks the image is, maybe an obsolete 3rd-party image-store site but it seems to be consistent among early years posts all over the place so they might be once-hosted-on-here ones that didn't get ported (or not properly so).

User avatar
jc
Posts: 345
Joined: Fri May 04, 2007 5:48 pm UTC
Location: Waltham, Massachusetts, USA, Earth, Solar System, Milky Way Galaxy
Contact:

Re: 1953: "The History of Unicode"

Postby jc » Thu Feb 22, 2018 11:09 pm UTC

xtifr wrote:So yes, why does the forum block random unicode characters like Egyptian Hieroglyphs? I can understand blocking things like the bidi control characters, and I can maybe understand blocking the penis hieroglyph (and the ejaculating penis heiroglyph), but the whole code block? That's just bizarre.

In any case, though I can't post it (and that may be just as well), I do want to remind aubergine users that there are perfectly good unicode characters for the penis itself: U+130B8 and U+130BA. Not as colorful as the aubergine, perhaps, but much less ambiguous.

For anyone who's curious, you can find those characters at http://www.unicode.org/charts/PDF/U13000.pdf .
.

User avatar
ucim
Posts: 6410
Joined: Fri Sep 28, 2012 3:23 pm UTC
Location: The One True Thread

Re: 1953: "The History of Unicode"

Postby ucim » Thu Feb 22, 2018 11:39 pm UTC

... and where can I find a list of unicode characters with (potentially bad) side effects on web pages, browsers, and other user agents?

Jose
Order of the Sillies, Honoris Causam - bestowed by charlie_grumbles on NP 859 * OTTscar winner: Wordsmith - bestowed by yappobiscuts and the OTT on NP 1832 * Ecclesiastical Calendar of the Order of the Holy Contradiction * Please help addams if you can. She needs all of us.

User avatar
Soupspoon
You have done something you shouldn't. Or are about to.
Posts: 3503
Joined: Thu Jan 28, 2016 7:00 pm UTC
Location: 53-1

Re: 1953: "The History of Unicode"

Postby Soupspoon » Fri Feb 23, 2018 12:45 am UTC

Well, there'd be a potential list for one security issue, that I'm not even sure that we are protected from. Or could/should be, because there are legitimate reasons for using those сhагасtегѕ. And that's just an obvious exploit. I never actually typed the plural of "character", in this post, and I made not as much effort to be subtle as I could have (you probably spotted it). Imagine that instead I was messing about with RTL and other non-printing codes that are 'allowable'?

There are people whose whole paycheque (legitimate or illicit) heavily depends upon thinking these things through. Or responding to them, hence a good part of the CVEs thing. (It was been a sideline/distraction, at most, even when I was ostensibly employed as consultant in IT security (they changed my job name, but not my workload, which I was Ok with), so I let the Big Guys earn their keep without professing anything like a similar level of expertise, or claiming any particularly significqnt hat (of any greyscale value) for myself.)

User avatar
Steve the Pocket
Posts: 688
Joined: Mon Apr 23, 2007 4:02 am UTC
Location: Going downtuuu in a Luleelurah!

Re: 1953: "The History of Unicode"

Postby Steve the Pocket » Fri Feb 23, 2018 7:29 am UTC

jc wrote:
xtifr wrote:So yes, why does the forum block random unicode characters like Egyptian Hieroglyphs? I can understand blocking things like the bidi control characters, and I can maybe understand blocking the penis hieroglyph (and the ejaculating penis heiroglyph), but the whole code block? That's just bizarre.

In any case, though I can't post it (and that may be just as well), I do want to remind aubergine users that there are perfectly good unicode characters for the penis itself: U+130B8 and U+130BA. Not as colorful as the aubergine, perhaps, but much less ambiguous.

For anyone who's curious, you can find those characters at http://www.unicode.org/charts/PDF/U13000.pdf .
.

What the hell.

Forget emojis; why in Dickens's deuce is this a thing?!
cephalopod9 wrote:Only on Xkcd can you start a topic involving Hitler and people spend the better part of half a dozen pages arguing about the quality of Operating Systems.

Baige.

User avatar
Flumble
Yes Man
Posts: 2024
Joined: Sun Aug 05, 2012 9:35 pm UTC

Re: 1953: "The History of Unicode"

Postby Flumble » Fri Feb 23, 2018 12:06 pm UTC

Steve the Pocket wrote:Forget emojis; why in Dickens's deuce is this a thing?!

You know what the "uni" in unicode stands for, right? Egyptian hieroglyphs have (allegedly) been used as a script by a whole civilization.
It makes much more sense for those to be included in unicode than, say, emoji ...let alone things like skin tone modifiers. U+130B8 U+1F3FE U+1F93F U+0356

User avatar
orthogon
Posts: 2939
Joined: Thu May 17, 2012 7:52 am UTC
Location: The Airy 1830 ellipsoid

Re: 1953: "The History of Unicode"

Postby orthogon » Fri Feb 23, 2018 2:13 pm UTC

jc wrote:
xtifr wrote:So yes, why does the forum block random unicode characters like Egyptian Hieroglyphs? I can understand blocking things like the bidi control characters, and I can maybe understand blocking the penis hieroglyph (and the ejaculating penis heiroglyph), but the whole code block? That's just bizarre.

In any case, though I can't post it (and that may be just as well), I do want to remind aubergine users that there are perfectly good unicode characters for the penis itself: U+130B8 and U+130BA. Not as colorful as the aubergine, perhaps, but much less ambiguous.

For anyone who's curious, you can find those characters at http://www.unicode.org/charts/PDF/U13000.pdf .
.

I am disappoint that the name of that codepoint is "EGYPTIAN HIEROGLYPH D053" not "EGYPTIAN HIEROGLYPH EJACULATING PENIS". It's like they noticed that was one of the hieroglyphs, and prudishly decided to call them all by their codepoints in some other system to spare their blushes. If it's good enough for hieroglyphs, why aren't the Roman letters called things like "ASCII CHARACTER 41"?
xtifr wrote:... and orthogon merely sounds undecided.

User avatar
Steve the Pocket
Posts: 688
Joined: Mon Apr 23, 2007 4:02 am UTC
Location: Going downtuuu in a Luleelurah!

Re: 1953: "The History of Unicode"

Postby Steve the Pocket » Fri Feb 23, 2018 8:12 pm UTC

Flumble wrote:
Steve the Pocket wrote:Forget emojis; why in Dickens's deuce is this a thing?!

You know what the "uni" in unicode stands for, right? Egyptian hieroglyphs have (allegedly) been used as a script by a whole civilization.

A dead civilization. It's not like we're leaving modern-day ancient Egyptians without a way to communicate on their phones. And I've never seen anyone use hieroglyphics in print, only ever just photographs of tomb inscriptions. Maybe a monochrome diagram from time to time, on its own line, but you can digitize that without custom characters.
cephalopod9 wrote:Only on Xkcd can you start a topic involving Hitler and people spend the better part of half a dozen pages arguing about the quality of Operating Systems.

Baige.

xtifr
Posts: 327
Joined: Wed Oct 01, 2008 6:38 pm UTC

Re: 1953: "The History of Unicode"

Postby xtifr » Sat Feb 24, 2018 8:09 pm UTC

Steve the Pocket wrote:
Flumble wrote:You know what the "uni" in unicode stands for, right? Egyptian hieroglyphs have (allegedly) been used as a script by a whole civilization.

A dead civilization.


A dead civilization which is the subject of tons of research and which regularly generates scholarly papers. The fact that you may only read popularizations which show photos of tombs rather than going into detail about the language doesn't mean there aren't regularly published papers about the language, or which quote the language in detail.

Heck, Egyptology is such a prominent branch of archaeology that it has its own name.

But ancient Egyptian isn't the only dead language covered by Unicode. Unicode is big! It has Cuneiform and Ogham and Runic as well. And plenty of room to spare.

There's a movement to get Tolkien's Tengwar (Elvish) accepted. Some folks have even staked out a code block and produced unofficial fonts using that block. So far, the Unicode Consortium have balked at accepting any made-up languages, but there's enough scholarship studying Tolkien that they could end up conceding on this one. For the moment, though, they're still limiting it to real languages. But that is pretty much the only limit so far.

Unicode isn't designed just to allow millennials in the US and China to send each other pictures of aubergines. It's designed to be useful for grownups as well! :P :wink:

edit: fix typo
"[T]he author has followed the usual practice of contemporary books on graph theory, namely to use words that are similar but not identical to the terms used in other books on graph theory."
-- Donald Knuth, The Art of Computer Programming, Vol I, 3rd ed.

rmsgrey
Posts: 3412
Joined: Wed Nov 16, 2011 6:35 pm UTC

Re: 1953: "The History of Unicode"

Postby rmsgrey » Sun Feb 25, 2018 12:49 am UTC

xtifr wrote:So far, the Unicode Consortium have balked at accepting any made-up languages


I expect most made-up languages are supported by Unicode (a lot are supported by ASCII) - it's the made-up graphemes that the Consortium presumably draw the line at.

User avatar
GlassHouses
Posts: 124
Joined: Thu Nov 24, 2016 12:41 pm UTC

Re: 1953: "The History of Unicode"

Postby GlassHouses » Sun Feb 25, 2018 12:52 am UTC

xtifr wrote:So far, the Unicode Consortium have balked at accepting any made-up languages

What about emoji, though?

N.B. I'm not one of those people who disapprove of emoji in Unicode. There are plenty of code points available to have some fun with... But I am curious, what's the official rationale there?

User avatar
Pfhorrest
Posts: 4806
Joined: Fri Oct 30, 2009 6:11 am UTC
Contact:

Re: 1953: "The History of Unicode"

Postby Pfhorrest » Sun Feb 25, 2018 1:42 am UTC

Emoji predated unicode and are in some older CJK encodings that unicode has to interconvert with (unicode’s primary mission), and since it then has to do that basic unicode anyway and people turned out to use it so much, they way as well get it right while they’re at it.
Forrest Cameranesi, Geek of All Trades
"I am Sam. Sam I am. I do not like trolls, flames, or spam."
The Codex Quaerendae (my philosophy) - The Chronicles of Quelouva (my fiction)


Return to “Individual XKCD Comic Threads”

Who is online

Users browsing this forum: No registered users and 30 guests