Hacker News

Shapecatcher: Draw the Unicode character you want

by nnnnnion 10/2/2014, 9:08:15 PM with 17 comments

by Springtimeon 10/3/2014, 10:05:50 AM
What would perhaps be a worthwhile feature is the ability to input already known unicode names into a text box after seeing the results, to feed the database with more useful matches.
For example, tried drawing a somewhat joined 'TM' a couple times but no matches for 'Trademark symbol', however a way to manually input that unicode/name might provide the database with a positive match for the next user trying to find it.
by drodgerson 10/3/2014, 10:17:58 AM
They didn't think to weight the prior probabilities by usage frequency* - drawing a reasonable ? gives me ȓ, ᕉ, ╔, ᣑ, Ѓ, ק, ᒌ, ŕ, ᒤ, ᒦ, ņ, ᒯ, ѓ, and finally ?.
I'm also guessing that they're directly comparing the handwritten character to some version of the unicode character rather than with human attempts to draw the character. Human drawings are often quite different (more slanted, stylised etc.) than typeface characters. This is much more forgiveable though because assembling a good dataset for human drawn characters is hard (especially for any reasonable chunk of the unicode set).
(*this is fairly easy to do: just find some large source of typical unicode, like Wikipedia in all languages, and index them).
by Flenseron 10/3/2014, 11:44:02 AM
Some other useful unicode websites:
http://www.amp-what.com/
http://www.fileformat.info/info/unicode/char/search.htm
http://www.fileformat.info/info/unicode/block/index.htm
(I've added the above as search engines in Chrome with short mnemonics for the keyword.)
There's also:
http://unicode.johnholtripley.co.uk/ -- mobile unicode support tables
http://unicodinator.com/
http://apps.timwhitlock.info/emoji/tables/unicode -- emoji :D
http://character-code.com/
http://panmental.de/symbols/info.htm
and of course:
http://copypastecharacter.com/
Want more unicode resources? There's a list of other resources here:
http://joewlarson.com/blog/2014/01/01/useful-unicode-resourc...
by lgason 10/3/2014, 9:17:35 AM
This is great, it's like http://detexify.kirelabs.org/classify.html but for unicode instead of latex.
by xg15on 10/3/2014, 11:22:53 AM
An interesting side effect of this is that it shows once again why (naive implementations of) international domain names presented such a large security risk. Just draw an "A" and look at the results...
by ajbon 10/3/2014, 9:28:55 AM
There is also http://www.nciku.com/ for chinese characters.
by lost_nameon 10/3/2014, 5:52:43 PM
It seems (and is logical, I suppose) that you have to match the symbol pretty closely to get what you're looking for. I drew a car twice, the first time getting absolutely nothing relevant, and the second time -- trying to be more precise and using all the space available -- got automobile, taxi, bus, etc, etc.
edit: The primary problem here, I mean, is that if you don't know what the symbol looks like and want to see if it exists, you might not get hits the first time you try to draw, but it might not actually exist anyway.
by nnnnnion 10/2/2014, 9:12:35 PM
It doesn't support every unicode character yet, but it's getting there. For example, it recognized the Kannada character ttha (ಠ), but it doesn't know the poop (💩) character.
by rootbearon 10/3/2014, 4:20:14 PM
I've often thought that the best way to get access to the richness of Unicode would be a drawing pad, perhaps as part of the keyboard, or as an on-screen area, for use with a mouse. Character maps just seem clumsy to me.
For fun, I tried Eth ð, Thorn þ, and Hungarian ű, all of which it got, but not as the first choice. It did not find the Ing rune, which looks a bit like a < and > combined.
by byuuon 10/3/2014, 11:07:59 AM
Oh wow, the drawing mechanism is really satisfying.
Too bad about not supporting 漢字. The only half-decent IME pad is on Windows. Online ones (kanji.sljfaq.org) and Xorg ones (ibus-mozc) are just horrendously bad at detection. I usually have to resort to multi-radical lookups.
by teddyhon 10/3/2014, 9:27:16 AM
Nice. I wish the GNOME Character Map¹ could do this.
① https://wiki.gnome.org/Apps/Gucharmap
by adultSwimon 10/3/2014, 7:50:15 PM
I drew #. Returned a bunch of characters but none of them was regular old 0x0023.
Took me several tries to get # (regular ascii number sign - i.e. shift+3).
by whittenon 10/3/2014, 5:45:41 PM
This is pretty cool. I expect that even if the code is open source, that the real value is in the dataset used. Does anyone know the licensing information ?
by danielweberon 10/3/2014, 6:46:36 PM
This is one of the first sites I learned about from HN a few years ago. Extremely useful.
by danieltilletton 10/3/2014, 10:01:40 AM
I just did an 8 and got nothing close, but wow there are a lot of interesting Unicode characters.
by imakesnowflakeson 10/3/2014, 10:09:39 AM
It works. Good work.
by benvdson 10/3/2014, 9:16:48 AM
i drew a penis and got the "male sign" :-)