Nick is slamming random keys with oversized mits on: json-ld (part 2 of 3)

Subject: json-ld (part 2 of 3)

note: originally emailed 06/19/2014 and sanitized for public consumption
sidenote: posting here so i can lead people here whenever they ask me about this stuff...
series: part 1 & part 3

This message is primarily for the TO: peeps. But some of you have asked about json-ld before and this would benefit everyone here for future reference.

Epiphany:
While reading up on some (computer) graphics articles -- I found out (in a round-about way) what ~~linked-data~~ (um, actually) the * description languages would be good for.

Not so fast:
First, why I still think it's not needed (in the context of javascript's json handler) -- it's unused information (you'll see this in a sec).

For more info on why I don't like json-ld (just like why I don't like VM's) -- it's heavy with a lot of un-needed cruff (you can read more about it from the email thread below).

Again, I'm coming from a javascript coder [for a project with restricted and security considerations] point of view...

So, what is it good for?
It (linked-data/interface description language/data format description language/etc.) is good for:

binary only data (i.e. structs/class)

getting transported between different programming languages
which might be running on different architecture (endian-ness, 32 vs 64 bits, etc.)

these (the data payload) are quite specific and can lead to catastrophic behavior if not handled properly

Examples of this type of mechanisms are:

ZeroC's Ice
Apache's Thrift
Google's Protocol Buffer
etc...

What does all that mean?
Basically, it's good for defining your data structs without (sort of) knowing anything about them.

Here's what I mean. Back to the graphics article I mentioned before. Blender (3D) is an open source modeler that had its beginnings 20 years ago.

http://code.blender.org/index.php/2013/12/how-blender-started-twenty-years-ago/

The creator of that program made his binary data files with "encoded types for the entire internal structure" -- in other words, self-describing and included in the data file. The intention was to make it backwards compatible knowing that future versions of the program will always require inevitable changes to the data file format.

http://www.blendernation.com/2008/12/01/blender-dna-rna-and-backward-compatibility/

Blender DNA, RNA and extreme backward compatibility
"This makes .blend files still readable, even when saved in 1.00"

This blew my mind.

20 years of the application -- and you can still load the data file that was made from v1.00

Not sure if a v2.71 (current version) data file would load on the v1.00 application, but you get the gist

Here are details on how this is done (just in case you're interested in how it was done)

Why strike ~~linked-data~~ and call out the * description languages
Back to "unused information" I mentioned earlier, [for our system] -- the LD stuff will NEVER be used (I can pretty much say that now with 99.9% certainty).

C/C++/java/etc. however, would HUGELY benefit from a self-describing struct/class object.

So when I started to play with:

http://json-ld.org/playground/

I said to myself, "this is interesting -- but what is this good for?"" -- I even said ([in part 1]):

this terse way of representing all of the data [is] highly error prone if written by humans -- this will need to be generated with tools just to be safe

The playground only showed results for JSON-LD and an RDF item.

But, there seems to be a number of possible representation for the same dataset:

http://rdf-translator.appspot.com/
- Raw data: ( CSML | CSV )
- RDF ( N-Triples | N3/Turtle | JSON | XML )
- OData ( Atom | JSON )
- Microdata ( JSON | HTML)
- JSON-LD

I thought to myself, there has got to be a better way to create this (remember: highly error prone if written by humans)

Then, the following showed up in [redacted] (today in fact!):

[link redacted]
They mentioned hive-json

This showed how a json ld-ish output was created from (in this case) the apache hive schema

So, out of curiosity, I thought if there were other ways to help write json-ld via converters?

Here's what I found:

http://jsonschema.net/reboot/#/

Write out your 'sample' json data
Click on the "schemarize" button it to generate the starter "schema"
Use the schema to help write the json-ld definitions

There were other resources I scanned through -- not really useful by itself, but helpful to figure out this linked-data stuff

http://www.cambridgesemantics.coam/semantic-university/sparql-by-example

Nick is slamming random keys with oversized mits on

Thursday, June 19, 2014

json-ld (part 2 of 3)

No comments:

Blog Archive

nick's shared items

s'more interesting reads

About Me