Tagged: writing Toggle Comment Threads | Keyboard Shortcuts

  • Profile photo of nmw

    nmw 19:04:09 on 2016/05/27 Permalink
    Tags: , , artificial languages, , , , , , , emerge, , human intelligence, , , , , , , , , , , , , , , , , , traing set, traing sets, , , , , writing   

    Literacy and Machine Readability: Some First Attempts at a Derivation of the Primary Implications for Rational Media 

    Online, websites are accessed exclusively via machine-readable text. Specifically, the character set prescribed by ICANN, IANA, and similar regulatory organizations consists of the 26 characters of the latin alphabet, the „hyphen“ character and the 10 arabic numbers (i.e. The symbols / zyphers 0-9). Several years ago, there was a move to accommodate other language character sets (this movement is generally referred to as „Internationalized Domain Names“ [IDN]), but in reality this accommodation is nothing more than an algorithm which translates writing using such „international“ symbols into strings from the regular latin character set, and to used reserved spaces from the enormous set of strings managed by ICANN for such „international“ strings. In reality, there is no way to register a string directly using such „international“ characters. Another rarely mentioned tidbit is that this obviously means that the set of IDN strings that can be registered is vastly smaller than strings exclusively using the standardized character set approved for direct registration.

    All of that is probably much more than you wanted to know. The „long story short“ is that all domain names are machine readable (note, however, that – as far as I know – no search engine available today on the world-wide-web uses algorithms to translate IDN domain name strings into their intended „international“ character strings). All of the web works exclusively via this approved character set (even the so-called „dotted decimals“ – the numbers which refer to individual computers [the „servers“] – are named exclusively using arabic numerals, though in reality are based on groups of bits: each number represents a „byte“-sized group of 8 bits… in other words: it could be translated into a character set of 256 characters. In the past several years, there has also been a movement to extend the number of strings available to accommodate more computers from 4 bytes (commonly referred to as Ipv4 or „IP version 4“) to 6 bytes (commonly referred to as Ipv6 or „IP version 6“), thereby accommodating 256 x 256 = 65536 as many computers as before. Note, however, that each computer can accommodate many websites / domains, and the number of domain names available excedes the number of computers available by many orders of magnitude (coincidentally, the number of domain names available in each top level domain [TLD] is approximately 1 x 10^100 – in the decimal system, that’s a one with one hundred zeros, also known as 1 Googol).

    Again: Very much more than you wanted to know. 😉

    The English language has a much smaller number of words – a very large and extensive dictionary might have something like 100,000 entries. With variants such as plural forms or conjugated verb forms, that will still probably amount to far less than a million possible strings – in other words: about 94 orders of magnitude less than the number of strings available as domain names. What is more, most people you might meet on the street probably use only a couple thousand words in their daily use of „common“ language. Beyond that, the will use even fewer than that when they use the web to search for information (for example: instead of searching for „sofa“ directly, they may very well first search for something more general like „furniture“).

    What does „machine readable“ mean? It means a machine can take in data and process it algorithmicly to produce a result – you might call the result „information“. For example: There is a hope that machines will someday be able to process strings – or even groups of strings, such as this sentence – and be able to thereby derive („grok“ or „understand“) the meaning. This hope is a dream that has already existed for decades, but the successes so far have been extremely limited. As I wrote over a decade ago (in my first „Wisdom of the Language“ essay), it seems rather clear that languages change faster than machines will ever be able to understand them. Indeed, this is almost tautologically true, because machines (and so-called „artificial intelligence“) require training sets in order to learn (and such training sets from so-called „natural language“ must be expressions from the past – and not even just from the past, but also approved by speakers of the language, i.e. „literate“ people). So-called „pattern recognition“ – a crucial concept in the AI field – is always recognizing patterns which have been previously defined by humans. You cannot train a machine to do anything without a human trainer, who designs a plan (i.e., an algorithmic set of instructions) which flow from to human intelligence.

    There was a very trendy movement which was quite popular several years ago that led to the view that data might self-organize, that trends might „emerge from the data“ without needing the nuissance of consulting costly humans, and this movement eventually led to what is now commonly hyped as „big data“. All of this hype about „emergence“ is hogwash. If you don’t know what I mean when I say „hogwash“, then please look it up in a dictionary. 😉

     
  • Profile photo of nmw

    nmw 19:40:37 on 2015/11/30 Permalink
    Tags: , , , , , , phrase, phrases, , , , , , , , , , , , , writing   

    The Web Doesn’t Change Much from Day to Day 

    Lately, I have predominently been writing on some of my other websites (most recently, in particular Socio Biz and Nooblogs Com).

    Here’s something that I have observed, though, which is really a very personal insight. Some people think my work is about writing. No, it isn’t. It also isn’t about reading (or what some people like to call “listening“).

    My work is mostly about organizing — in particular: identifying those words and phrases people find most useful when they search for information (which is also known as “information retrieval”).

    Whether or not I write (or read, or listen to) something myself is not important — what you need to consider is: Whether or not you want to be found. If so, then you should contact me. 😉

     
  • Profile photo of nmw

    nmw 13:20:03 on 2015/07/23 Permalink
    Tags: author, authors, book, books, , , catalogs, , , , , , , , , , , , , , , , , , , , , , , , , , writing,   

    To Read or to Be Read 

    When I was a kid, I used to go to the library a lot… and read books. Before reading them, I would need to find them. For those of you unfamiliar with this process: This was the prototype for most search engines (back then, people studying this process went to “library schools”, graduate programs for “information science” — and the field specifically focused on what is today referred to “search”, back then it was called “information retrieval”).

    But reading is not really something “for specialists only”. Before graduating high school, regular folks also had to learn about publishing. For example: They needed to know that books have authors, that they were published by publishing companies, and so on.

    Online, titles, authors and publishing houses became domain names. There are also numbers which refer to computers — perhaps this is roughly equivalent to the way people would refer to specific shelves where specific books were stored (this system of naming shelves, which was used in the earliest libraries, would later give way to so-called “call-numbers”, a system whereby a book was given a specific sequential number where it could be found). The biggest difference between traditional libraries and the Internet is probably the fact that online, the cataloging and indexing systems are integrated into the same system as the writing that they catalog / index. Although professional abstracting and indexing services also published such volumes (which looked very much like “regular” books), and these books were usually also given call numbers, putting them on par with the more ordinary literature, the librarian was the person who made this decision… and the librarian was the person ultimately responsible for maintaining the catalog (and also for choosing what would be included in the library’s collection).

    I guess only quite novice users would assume that if something was not in the library (and/or the library’s catalogs) that it would not exist.

    Contrast that with today — where there is now an entire generation of kids who seem to believe that if something cannot be found in Google, that it doesn’t exist.

    Even though the rate of illiteracy today is quite astounding already, I now observe also that in recent years an entirely new trend is catching on. People are becoming ever less concerned with reading or writing or behaving as functionally literate persons. Instead: They are becoming more obsessed with being read… — meaning that someone (or some company) is able to trace their moves. Whereas it is becoming ever more rare for people the read or write anything resembling written texts (and/or “literature”), it is becoming ever more commonplace for people to clutch on to gadgets which track everything such quantified fetishists seem to place such a high value on. The typical quantified fetishist will feel much the same way about their gadget fetish as a democratic idealist might view the sanctity of the voting booth.

    In this milieu, there seems to also be a widespread belief that the companies collecting this data will share it publicly out of the warmness of their hearts.

     
  • Profile photo of nmw

    nmw 10:10:38 on 2015/07/16 Permalink
    Tags: , , alive, , , dead, , , , , fashion, , formless, formlessness, forms, , , , , , , , , , , writing,   

    WANTED: Is Written Language DEAD OR ALIVE? 

    We are constantly altering the forms we have inherited from previous generations, and these changes are signs of life and vitality. Indeed: The things that don’t change, the forms that rigidify, come to look to us like death — and we destroy them.

    Robert Greene (“48 Laws of Power”)

    One of the sayings I find particularly enticing is the notion that “Written language is dead”. It appeals to our experiences of printed materials, tomes that appear as tombstones of bygone ideas. Yet today, this dogma itself is no longer valid.

    Today, written language exists in the present. Written language lives and breathes according to the whims of an invisible hand that sweeps our attention from hither to thither. “Bells and whistles” give way to “ringtones”, and “ringtones” also succumb to other newfangled applications of fashion.

    More and more writing is becoming less and less etched in stone, it increasingly billows among flyers scattered by the winds of change, becomes evermore formless, ephemeral and transient. Fixed data points give way to fluid data streams.

    There is no need for remorse or backwards oriented attachment to the dead tomes of yesteryear or the innumerable generations dating back to the ancient past. We are not amoeba. We are, here and now, living in the present… — and constantly changing in order to better adapt to the future.

     
c
Compose new post
j
Next post/Next comment
k
Previous post/Previous comment
r
Reply
e
Edit
o
Show/Hide comments
t
Go to top
l
Go to login
h
Show/Hide help
shift + esc
Cancel
Skip to toolbar