Plain Text Descriptions of Currently Available Data Subsets

Introduction

Localized Applications often use small subsets of much larger federalized data sets. While the primary reason for this might be to increase access speed, this presents an interoperability challange. The desireable choice is not to scale up the Application, but rather to transform the coverage terms, identifiers and names while maintaining the data namespace, for which indeed, "One Size Fits All".

In many respects, this is the opposite of the Semantic Web, where it is necessary to mark down plain text descriptions of data origin while insuring transparency in the logic, social and legal concepts expressed.

Many of the mnemonic HTML entities are of little use in everyday activities and should be replaced by Unicode Escapes. Like flavors of Whitespace, these characters can be replaced without loss of meaning.

Linked Data Identifiers annotated in this fashion have no depth and no discoverable semantic meaning. Grouping, however, remains and when combined with the generic definitions given in RFC 2396, the data output of a Domain Authority is well defined.


Whitespace is reduced by mapping space and new line characters to either visible, icon or invisible active forms. Some Unicodes are not recognized characters at all and must be excluded. [White Space in XML]

Plain TextReduced Text in XML

Printable characters without extended semantic meaning are excluded from transformation. [Ignored Characters in XML]

Unicode / UTF-8 Character Set Handling

For this purpose, Plain Text is typed in the usual way.

Sets

Other characters and concepts require that additional information appear in the text serialization to support transparancy. These are data attributes. Although it is far from intuitively obvious, application data is always "current" and it is necessary to differentiate between the cost of a holiday taken two years ago and the projected cost of a holiday to be taken next summer. Temperature measurements are another example to this sort of time shifting.

Examples

Keyboard Typed Alphabet & Punctuation
Upper Case LettersABCDEFGHIFKLMNOPQRSTUVWXYZ
Lower Case Lettersabcdefghijklmnopqrstuvwxyz
Digits0123456789
Ordinary Punctuation, Math & Grouping,.?/;:{}[]\|!#^*()+=_-%
Whitespace
Space (visible)[␠] (␠)
Space (invisible)[space] ( )
New Line (visible)[␤] (␤)
New Line (invisible)[new line] (
)
Escaped Punctuation, Math & Grouping
Double Quote["] (")
Ampersand[&] (&)
Less Than[<] (&#x003C;)
Greater Than[>] (&#x003E;)
Single Quote['] (&#x0027;)
Linked Characters (eGov Marks & Unit Marks)
𝔸, , @, ©, , ¤, , ®, §, , , , , , , , USD ($), USD% (¢), USD‰ (‰), Temperature (℉), Temperature (℃)

Glossary of Linked Characters and Unit Characters

Special Characters

eGov Marks
Character Escape Meaning Data Attributes Notes
𝔸 &#x1D538; Agency Organization Name Domain Authority Or Imprimatur
&#x2205; Empty Element Symbol Only
@ &#x0040; Commercial At Number, Value For Bills Of Materials Or Lists
&#x00A9; Copyright Year, Organization Name For Any Use
&#x2117; Audio Copyright Year, Organization Name Audio Recordings Only
¤ &#x00A4; Currency Symbol, Decimal Number Actual Fee Paid
&#x00B6; Paragraph Document
® &#x00AE; Registered Organization Name
§ &#x00A7; Section Document For Legal Codes
&#x2120; Service Mark Organization Name
&#x2122; Trade Mark Organization Name
&#x2105; In Care Of Name Person or Organization
&#x211E; Prescription Substance
&#x2100; To The Account Of Name Person or Organization
&#x25CA; Administrative Subdivision (Level 1) Timezone, For (Level 3), In (Level 2) For Administrative Subgroups Or Regions
&#x2666; Administrative Subdivision (Level 1) Path Timezone, In (Level 2) For Administrative Subgroups Or Regions
Unit Marks
$ &#x0024; US Dollars Decimal Number [USD] Money, Offer For Sale
$% &#x0024;&#x0025; (1/100) US Dollars Number [USD%] Money, Offer For Sale
$‰ &#0024;&#x2030; (1/1000) US Dollars Number [USD‰] Money, Offer For Sale
&#x2109; Degrees Fahrenheit (Forecast|Current), (Type=High, Low, Average, etc.), Numeric Value Temperature
&#x2103; Degrees Centigrade (Forecast|Current), (Type=High, Low, Average, etc.), Numeric Value Temperature
Character Escape Meaning Data Attributes Notes