SlideShare una empresa de Scribd logo
1 de 42
Descargar para leer sin conexión
Creating Knowledge out of Interlinked 
Data 
LOD2 Webinar . 24.06.2014 . Page 1 http://lod2.eu
Creating Knowledge out of Interlinked 
Data 
http://lod2.eu 
LOD2 is a large-scale integrating project co-funded by the European 
Commission within the FP7 Information and Communication Technologies 
Work Programme. This 4-year project comprises leading Linked Open 
Data technology researchers, companies, and service providers. Coming 
from across 12 countries the partners are coordinated by the Agile 
Knowledge Engineering and Semantic Web Research Group at the 
University of Leipzig, Germany. 
LOD2 will integrate and syndicate Linked Data with existing large-scale 
applications. The project shows the benefits in the scenarios of Media and 
Publishing, Corporate Data intranets and eGovernment. 
LOD2 Webinar . 24.06.2014 . Page 2 http://lod2.eu
Creating Knowledge out of Interlinked 
Data 
http://lod2.eu 
Once per month the LOD2 webinar series offer a free webinar about 
tools and services along the Linked Open Data Life Cycle. 
Stay with us and learn more about acquisition, editing, composing, 
connected applications – and finally publishing Linked Open Data. 
LOD2 Webinar . 24.06.2014 . Page 3 http://lod2.eu
Creating Knowledge out of Interlinked 
Data 
Agenda 
• Nested Data Model 
• SIREn Overview 
• Getting Started with the SIREn Elasticsearch Plugin 
• Demo 
LOD2 Webinar . 24.06.2014 . Page 4 http://lod2.eu
Creating Knowledge out of Interlinked 
Data 
Schema-Less Nested Data Model 
• Model becoming prevalent: JSON, XML, Avro, … 
– Can be arbitrarily nested and large 
– No strict schema / structure enforced 
• Schema-less brings 
– Flexibility 
– Ease of development 
• Developers do not have to invest significant modelling 
effort upfront 
LOD2 Webinar . 24.06.2014 . Page 5 http://lod2.eu
Creating Knowledge out of Interlinked 
Data 
Introducing SIREn 
• Lucene, Solr and Elasticsearch plugin for indexing and 
searching JSON 
• Rich data model (JSON) 
– Nested objects, nested arrays, datatypes 
– Generic architecture compatible with various nested data models: JSON, 
JSON-LD, XML, Avro, ... 
• Schema-agnostic 
– SIREn does not require any schema definition to index and search data 
– Schema definition can change across records 
• Designed from the ground up for high performance and 
scalability 
LOD2 Webinar . 24.06.2014 . Page 6 http://lod2.eu
Creating Knowledge out of Interlinked 
Data 
Introducing SIREn 
LOD2 Webinar . 24.06.2014 . Page 7 http://lod2.eu
Creating Knowledge out of Interlinked 
Data 
Elasticsearch - Overview 
• Document-oriented search and analytics engine 
– JSON Document 
– Based on Apache Lucene 
• Distributed, Replication 
– High Performance and Availability 
• REST API 
LOD2 Webinar . 24.06.2014 . Page 8 http://lod2.eu
Creating Knowledge out of Interlinked 
Data 
Elasticsearch – Basic Concepts 
• Index = Collection of Documents 
– Can have multiple shards and replicas 
• Type = A set of documents sharing the same schema 
– Similar to a DB table 
• Document = JSON object 
– Uniquely identified (index/type/id) 
– Similar to a DB record 
LOD2 Webinar . 24.06.2014 . Page 9 http://lod2.eu
Creating Knowledge out of Interlinked 
Data 
SIREn – Basic Concepts 
• JSON object = Tree 
– Different mapping available 
• Node = An element of the tree 
– Can have a parent and one or more children 
– Contains data: text, numeric, boolean 
LOD2 Webinar . 24.06.2014 . Page 10 http://lod2.eu
Creating Knowledge out of Interlinked 
Data 
SIREn – Basic Concepts 
JSON Object Tree 
{ 
{ 
"name" : "Elasticsearch", 
"category_code" : "search", 
"funding_rounds" : [ 
"name" : "Elasticsearch", 
"category_code" : "search", 
"funding_rounds" : [ 
{ 
{ 
"round_code" : "a", 
"raised_amount" : 10000000, 
"funded_year" : 2012, 
"investments" : [ 
"round_code" : "a", 
"raised_amount" : 10000000, 
"funded_year" : 2012, 
"investments" : [ 
{ 
{ 
"name" : "Data Collective", 
"type" : "financial-org" 
"name" : "Data Collective", 
"type" : "financial-org" 
}, 
… 
}, 
… 
] 
] 
}, 
… 
}, 
… 
] 
] 
} 
} 
name : Elasticsearch 
category_code : search 
funding_rounds 
round_code : a 
raised_amount : 10000000 
funded_year : 2012 
investments 
name : Data Collective 
type : financial-org 
investments 
name : ... 
LOD2 Webinar . 24.06.2014 . Page 11 http://lod2.eu
Creating Knowledge out of Interlinked 
Data 
Getting Started with Elasticsearch & SIREn 
http://sirendb.com/downloads/ 
(Elasticsearch Distribution Coming Soon) 
LOD2 Webinar . 24.06.2014 . Page 12 http://lod2.eu
Creating Knowledge out of Interlinked 
Data 
SIREn Elasticsearch Distribution 
├── dist 
│ ├── siren-core-1.3.jar 
│ └── siren-qparser-1.3.jar 
├── docs 
│ ├── apidoc 
│ └── siren-1.3-javadoc.jar 
├── example 
├── CHANGES.txt 
├── LICENSE.md 
├── README.md 
└── THIRD-PARTY.txt 
├── dist 
│ ├── siren-core-1.3.jar 
│ └── siren-qparser-1.3.jar 
├── docs 
│ ├── apidoc 
│ └── siren-1.3-javadoc.jar 
├── example 
├── CHANGES.txt 
├── LICENSE.md 
├── README.md 
└── THIRD-PARTY.txt 
SIREn 
Libraries 
SIREn 
Libraries 
SIREn 
Javadoc 
SIREn 
Javadoc 
Elasticsearch 
Distribution 
Elasticsearch 
Distribution 
LOD2 Webinar . 24.06.2014 . Page 13 http://lod2.eu
Creating Knowledge out of Interlinked 
Data 
SIREn Elasticsearch 
Distribution 
├── bin 
│ ├── elasticsearch 
│ └── ... 
├── config 
│ ├── elasticsearch.yml 
│ └── logging.yml 
├── lib 
│ ├── elasticsearch-1.2.0.jar 
│ └── ... 
├── plugins 
│ └── siren-plugin 
│ ├── siren-elasticsearch-1.3.jar 
│ └── ... 
├── datasets 
├── NOTICE.txt 
└── README.md 
├── bin 
│ ├── elasticsearch 
│ └── ... 
├── config 
│ ├── elasticsearch.yml 
│ └── logging.yml 
├── lib 
│ ├── elasticsearch-1.2.0.jar 
│ └── ... 
├── plugins 
│ └── siren-plugin 
│ ├── siren-elasticsearch-1.3.jar 
│ └── ... 
├── datasets 
├── NOTICE.txt 
└── README.md 
EExxeeccuutatabbleless 
CCoonnfifgiguuraratitoionn 
SSIRIREEnn P Plulugginin 
LOD2 Webinar . 24.06.2014 . Page 14 http://lod2.eu
Creating Knowledge out of Interlinked 
Data 
Starting Elasticsearch 
$ ./bin/elasticsearch 
$ ./bin/elasticsearch 
[INFO ][node ] [Valerie Cooper] version[1.2.0], pid[12842], … 
[INFO ][node ] [Valerie Cooper] initializing ... 
[INFO ][plugins] [Valerie Cooper] loaded [siren-plugin], sites [] 
[INFO ][node ] [Valerie Cooper] initialized 
[INFO ][node ] [Valerie Cooper] starting ... 
… 
[INFO ][node ] [Valerie Cooper] started 
[INFO ][node ] [Valerie Cooper] version[1.2.0], pid[12842], … 
[INFO ][node ] [Valerie Cooper] initializing ... 
[INFO ][plugins] [Valerie Cooper] loaded [siren-plugin], sites [] 
[INFO ][node ] [Valerie Cooper] initialized 
[INFO ][node ] [Valerie Cooper] starting ... 
… 
[INFO ][node ] [Valerie Cooper] started 
LOD2 Webinar . 24.06.2014 . Page 15 http://lod2.eu
Creating Knowledge out of Interlinked 
Data 
Create an Index and Map 
SIREn’s Field 
$ curl -XPUT 'http://localhost:9200/test/' –d ' 
{ 
$ curl -XPUT 'http://localhost:9200/test/' –d ' 
{ 
"mappings" : { 
"mappings" : { 
"companies" : { 
"companies" : { 
"properties" : { 
"properties" : { 
SIREn’s Field 
"_siren_source" : { 
"index" : "analyzed", 
"analyzer" : "hybrid-json", 
"postings_format" : "Siren10AFor", 
"store" : "no", 
"type" : "string" 
} 
}, 
"_siren" : {} 
"_siren_source" : { 
"index" : "analyzed", 
"analyzer" : "hybrid-json", 
"postings_format" : "Siren10AFor", 
"store" : "no", 
"type" : "string" 
} 
}, 
"_siren" : {} 
} 
} 
} 
} 
}' 
}' 
SIREn’s Analyzer 
SIREn’s 
Posting Format 
LOD2 Webinar . 24.06.2014 . Page 16 http://lod2.eu
Creating Knowledge out of Interlinked 
Data 
Indexing a Document 
$ curl -XPUT 'http://localhost:9200/test/companies/1' –d ' 
{ 
$ curl -XPUT 'http://localhost:9200/test/companies/1' –d ' 
{ 
"name" : "Elasticsearch", 
"category_code" : "search", 
"funding_rounds" : [ 
"name" : "Elasticsearch", 
"category_code" : "search", 
"funding_rounds" : [ 
{ 
{ 
"round_code" : "a", 
"raised_amount" : 10000000, 
"funded_year" : 2012, 
"investments" : [ 
"round_code" : "a", 
"raised_amount" : 10000000, 
"funded_year" : 2012, 
"investments" : [ 
{ 
{ 
"name" : "Data Collective", 
"type" : "financial-org" 
"name" : "Data Collective", 
"type" : "financial-org" 
} 
} 
] 
] 
} 
} 
] 
] 
}' 
}' 
LOD2 Webinar . 24.06.2014 . Page 17 http://lod2.eu
Creating Knowledge out of Interlinked 
Data 
Customising Datatype Analysis 
• SIREn distinguishes 5 types of values in the JSON model: 
– Field, String, Long, Double, Boolean 
{ 
{ 
"name" : "Elasticsearch", 
"category_code" : "search", 
"funding_rounds" : [ 
"name" : "Elasticsearch", 
"category_code" : "search", 
"funding_rounds" : [ 
{ 
{ 
"round_code" : "a", 
"raised_amount" : 10000000, 
"funded_year" : 2012, 
"investments" : [ 
"round_code" : "a", 
"raised_amount" : 10000000, 
"funded_year" : 2012, 
"investments" : [ 
{ 
{ 
json:field 
"name" : "Data Collective", 
"type" : "financial-org" 
"name" : "Data Collective", 
"type" : "financial-org" 
} 
} 
] 
] 
} 
} 
] 
] 
} 
} 
xsd:long 
xsd:string 
LOD2 Webinar . 24.06.2014 . Page 18 http://lod2.eu
Creating Knowledge out of Interlinked 
Data 
Customising Datatype Analysis 
# config/elasticsearch.yml 
# config/elasticsearch.yml 
siren.analysis.datatype: 
siren.analysis.datatype: 
http://json.org/field: 
http://json.org/field: 
index_analyzer: keyword 
index_analyzer: keyword 
http://www.w3.org/2001/XMLSchema#string: 
http://www.w3.org/2001/XMLSchema#string: 
index_analyzer: simple 
search_analyzer: simpleFolding 
index_analyzer: simple 
search_analyzer: simpleFolding 
http://www.w3.org/2001/XMLSchema#long: 
http://www.w3.org/2001/XMLSchema#long: 
index_analyzer: long 
index_analyzer: long 
index.analysis.analyzer: 
index.analysis.analyzer: 
simpleFolding: 
simpleFolding: 
type: custom 
tokenizer: letter 
filter: [lowercase, asciifolding] 
type: custom 
tokenizer: letter 
filter: [lowercase, asciifolding] 
LOD2 Webinar . 24.06.2014 . Page 19 http://lod2.eu
Creating Knowledge out of Interlinked 
Data 
Searching: Node Query 
$ curl -XGET 'localhost:9200/test/companies/_search' –d ' 
{ 
"query" : { 
$ curl -XGET 'localhost:9200/test/companies/_search' –d ' 
{ 
"query" : { 
"hybrid-json" : { 
"hybrid-json" : { 
"node" : { 
"node" : { 
"query" : "search", 
"query" : "search", 
} 
} 
} 
} 
}' 
}' 
name : Elasticsearch 
category_code : search 
funding_rounds 
round_code : a 
raised_amount : 10000000 
funded_year : 2012 
investments 
name : Data Collective 
type : financial-org 
investments 
name : ... 
SIREn’s Query Type 
SIREn’s Query 
Full-Text Query: 
•Boolean, Phrase, Range, Fuzzy, 
Regexp, Proximity, ... 
LOD2 Webinar . 24.06.2014 . Page 20 http://lod2.eu
Creating Knowledge out of Interlinked 
Data 
Searching: Node Query 
$ curl -XGET 'localhost:9200/test/companies/_search' –d ' 
{ 
"query" : { 
$ curl -XGET 'localhost:9200/test/companies/_search' –d ' 
{ 
"query" : { 
"hybrid-json" : { 
"hybrid-json" : { 
"node" : { 
"node" : { 
"query" : "search", 
"query" : "search", 
} 
} 
} 
} 
}' 
}' 
name : Elasticsearch 
category_code : search 
funding_rounds 
round_code : a 
raised_amount : 10000000 
funded_year : 2012 
investments 
name : Data Collective 
type : financial-org 
investments 
name : ... 
LOD2 Webinar . 24.06.2014 . Page 21 http://lod2.eu
Creating Knowledge out of Interlinked 
Data 
Searching: Node Query 
$ curl -XGET 'localhost:9200/test/companies/_search' –d ' 
{ 
"query" : { 
$ curl -XGET 'localhost:9200/test/companies/_search' –d ' 
{ 
"query" : { 
"hybrid-json" : { 
"hybrid-json" : { 
"node" : { 
"node" : { 
"query" : "search", 
"attribute" : "category_code", 
"query" : "search", 
"attribute" : "category_code", 
} 
} 
} 
} 
}' 
}' 
name : Elasticsearch 
category_code : search 
funding_rounds 
round_code : a 
raised_amount : 10000000 
funded_year : 2012 
investments 
name : Data Collective 
type : financial-org 
investments 
name : ... 
LOD2 Webinar . 24.06.2014 . Page 22 http://lod2.eu
Creating Knowledge out of Interlinked 
Data 
Searching: Node Query 
$ curl -XGET 'localhost:9200/test/companies/_search' –d ' 
{ 
"query" : { 
$ curl -XGET 'localhost:9200/test/companies/_search' –d ' 
{ 
"query" : { 
"hybrid-json" : { 
"hybrid-json" : { 
"node" : { 
"node" : { 
"query" : "search", 
"attribute" : "category_code", 
"query" : "search", 
"attribute" : "category_code", 
} 
} 
} 
} 
}' 
}' 
name : Elasticsearch 
category_code : search 
funding_rounds 
round_code : a 
raised_amount : 10000000 
funded_year : 2012 
investments 
name : Data Collective 
type : financial-org 
investments 
name : ... 
LOD2 Webinar . 24.06.2014 . Page 23 http://lod2.eu
Creating Knowledge out of Interlinked 
Data 
Searching: Twig Query 
• Query operators for Ancestor-Descendant and Parent- 
Child relationships 
LOD2 Webinar . 24.06.2014 . Page 24 http://lod2.eu
Creating Knowledge out of Interlinked 
Data 
Searching: Twig Query 
• Query operators for Ancestor-Descendant and Parent- 
Child relationships 
• Consists of a root query 
Boolean 
LOD2 Webinar . 24.06.2014 . Page 25 http://lod2.eu
Creating Knowledge out of Interlinked 
Data 
Searching: Twig Query 
• Query operators for Ancestor-Descendant and Parent- 
Child relationships 
• Consists of a root query and one or more child 
Boolean 
Phrase 
MUST 
LOD2 Webinar . 24.06.2014 . Page 26 http://lod2.eu
Creating Knowledge out of Interlinked 
Data 
Searching: Twig Query 
• Query operators for Ancestor-Descendant and Parent- 
Child relationships 
• Consists of a root query and one or more child and 
descendant queries 
Boolean 
Phrase 
MUST 
Boolean 
SHOULD 
LOD2 Webinar . 24.06.2014 . Page 27 http://lod2.eu
Creating Knowledge out of Interlinked 
Data 
Searching: Twig Query 
• Query operators for Ancestor-Descendant and Parent- 
Child relationships 
• Consists of a root query and one or more child and 
descendant queries 
• Can be nested to form complex tree structure 
Boolean 
Phrase 
MUST 
Twig 
NOT 
Range 
MUST 
Boolean 
SHOULD 
LOD2 Webinar . 24.06.2014 . Page 28 http://lod2.eu
Creating Knowledge out of Interlinked 
Data 
Searching: Twig Query 
$ curl -XGET 'localhost:9200/test/companies/_search' –d ' 
{ 
"query" : { 
$ curl -XGET 'localhost:9200/test/companies/_search' –d ' 
{ 
"query" : { 
"hybrid-json" : { 
"hybrid-json" : { 
"twig" : { 
"twig" : { 
Twig Query 
Operator Root Query 
"root" : "funding_rounds" 
"child" : [{ 
"root" : "funding_rounds" 
"child" : [{ 
"occur" : "must", 
"node" : { 
"occur" : "must", 
"node" : { 
"query" : "2012", 
"attribute" : "funded_year" 
"query" : "2012", 
"attribute" : "funded_year" 
} 
} 
}] 
}] 
} 
} 
} 
} 
}' 
}' 
name : Elasticsearch 
category_code : search 
funding_rounds 
round_code : a 
raised_amount : 10000000 
funded_year : 2012 
investments 
name : Data Collective 
type : financial-org 
investments 
name : ... 
Child Query 
LOD2 Webinar . 24.06.2014 . Page 29 http://lod2.eu
Creating Knowledge out of Interlinked 
Data 
Searching: Twig Query 
$ curl -XGET 'localhost:9200/test/companies/_search' –d ' 
{ 
"query" : { 
$ curl -XGET 'localhost:9200/test/companies/_search' –d ' 
{ 
"query" : { 
"hybrid-json" : { 
"hybrid-json" : { 
"twig" : { 
"twig" : { 
"root" : "funding_rounds" 
"child" : [{ 
"root" : "funding_rounds" 
"child" : [{ 
"occur" : "must", 
"node" : { 
"occur" : "must", 
"node" : { 
"query" : "2012", 
"attribute" : "funded_year" 
"query" : "2012", 
"attribute" : "funded_year" 
} 
} 
}] 
}] 
} 
} 
} 
} 
}' 
}' 
name : Elasticsearch 
category_code : search 
funding_rounds 
round_code : a 
raised_amount : 10000000 
funded_year : 2012 
investments 
name : Data Collective 
type : financial-org 
investments 
name : ... 
LOD2 Webinar . 24.06.2014 . Page 30 http://lod2.eu
Creating Knowledge out of Interlinked 
Data 
Searching: Twig Query 
$ curl -XGET 'localhost:9200/test/companies/_search' –d ' 
{ 
"query" : { 
$ curl -XGET 'localhost:9200/test/companies/_search' –d ' 
{ 
"query" : { 
"hybrid-json" : { 
"hybrid-json" : { 
"twig" : { 
"twig" : { 
"root" : "funding_rounds" 
"child" : [{ 
"root" : "funding_rounds" 
"child" : [{ 
"occur" : "must", 
"node" : { 
"occur" : "must", 
"node" : { 
"query" : "2012", 
"attribute" : "funded_year" 
"query" : "2012", 
"attribute" : "funded_year" 
} 
} 
}] 
}] 
} 
} 
} 
} 
}' 
}' 
name : Elasticsearch 
category_code : search 
funding_rounds 
round_code : a 
raised_amount : 10000000 
funded_year : 2012 
investments 
name : Data Collective 
type : financial-org 
investments 
name : ... 
LOD2 Webinar . 24.06.2014 . Page 31 http://lod2.eu
Creating Knowledge out of Interlinked 
Data 
Searching: Twig Query 
$ curl -XGET 'localhost:9200/test/companies/_search' –d ' 
{ 
"query" : { 
$ curl -XGET 'localhost:9200/test/companies/_search' –d ' 
{ 
"query" : { 
"hybrid-json" : { 
"hybrid-json" : { 
"twig" : { 
"twig" : { 
"root" : "funding_rounds" 
"child" : [{ 
"root" : "funding_rounds" 
"child" : [{ 
"occur" : "must", 
"node" : { 
"occur" : "must", 
"node" : { 
"query" : "2012", 
"attribute" : "funded_year" 
"query" : "2012", 
"attribute" : "funded_year" 
} 
} 
}] 
}] 
} 
} 
} 
} 
}' 
}' 
name : Elasticsearch 
category_code : search 
funding_rounds 
round_code : a 
raised_amount : 10000000 
funded_year : 2012 
investments 
name : Data Collective 
type : financial-org 
investments 
name : ... 
LOD2 Webinar . 24.06.2014 . Page 32 http://lod2.eu
Creating Knowledge out of Interlinked 
Data 
Searching: Twig Query 
$ curl -XGET 'localhost:9200/test/companies/_search' –d ' 
{ 
"query" : { 
$ curl -XGET 'localhost:9200/test/companies/_search' –d ' 
{ 
"query" : { 
"hybrid-json" : { 
"hybrid-json" : { 
"twig" : { 
"twig" : { 
"root" : "funding_rounds" 
"child" : [{ 
"root" : "funding_rounds" 
"child" : [{ 
"occur" : "must", 
"node" : { 
"occur" : "must", 
"node" : { 
"query" : "2012", 
"attribute" : "funded_year" 
"query" : "2012", 
"attribute" : "funded_year" 
} 
} 
}] 
}] 
} 
} 
} 
} 
}' 
}' 
name : Elasticsearch 
category_code : search 
funding_rounds 
round_code : a 
raised_amount : 10000000 
funded_year : 2012 
investments 
name : Data Collective 
type : financial-org 
investments 
name : ... 
LOD2 Webinar . 24.06.2014 . Page 33 http://lod2.eu
Creating Knowledge out of Interlinked 
Data 
Searching: Proximity 
Query 
$ curl -XGET 'localhost:9200/test/companies/_search' –d '{ 
"query" : { "hybrid-json" : { 
$ curl -XGET 'localhost:9200/test/companies/_search' –d '{ 
"query" : { "hybrid-json" : { 
"boolean" : { 
"boolean" : { 
"inOrder" : true, 
"slop" : 0, 
"clause" : [{ 
"inOrder" : true, 
"slop" : 0, 
"clause" : [{ 
"twig" : { 
"twig" : { 
"root" : "funding_rounds", 
"descendant" : [{ 
Proximity Constraint 
"root" : "funding_rounds", 
"descendant" : [{ 
Order Constraint 
"level" : 2, 
"node" : { "query" : "Data AND Collective" } 
"level" : 2, 
"node" : { "query" : "Data AND Collective" } 
}] 
}] 
} 
} 
},{ 
},{ 
"twig" : { 
"twig" : { 
"root" : "funding_rounds", 
"descendant" : [{ 
"root" : "funding_rounds", 
"descendant" : [{ 
"level" : 2, 
"node" : { "query" : "Index AND Ventures" } 
"level" : 2, 
"node" : { "query" : "Index AND Ventures" } 
}] 
}] 
} 
} 
}] 
}] 
}}}}' 
}}}}' 
name : Elasticsearch 
category_code : search 
funding_rounds 
investments 
name : Data Collective 
type : financial-org 
funding_rounds 
investments 
name : Index Ventures 
type : financial-org 
11 
22 
33 
44 
Boolean Clauses 
LOD2 Webinar . 24.06.2014 . Page 34 http://lod2.eu
Creating Knowledge out of Interlinked 
Data 
Searching: Proximity 
Query 
$ curl -XGET 'localhost:9200/test/companies/_search' –d '{ 
"query" : { "hybrid-json" : { 
$ curl -XGET 'localhost:9200/test/companies/_search' –d '{ 
"query" : { "hybrid-json" : { 
"boolean" : { 
"boolean" : { 
"inOrder" : true, 
"slop" : 0, 
"clause" : [{ 
"inOrder" : true, 
"slop" : 0, 
"clause" : [{ 
"twig" : { 
"twig" : { 
"root" : "funding_rounds", 
"descendant" : [{ 
"root" : "funding_rounds", 
"descendant" : [{ 
"level" : 2, 
"node" : { "query" : "Data AND Collective" } 
"level" : 2, 
"node" : { "query" : "Data AND Collective" } 
}] 
}] 
} 
} 
},{ 
},{ 
"twig" : { 
"twig" : { 
"root" : "funding_rounds", 
"descendant" : [{ 
"root" : "funding_rounds", 
"descendant" : [{ 
"level" : 2, 
"node" : { "query" : "Index AND Ventures" } 
"level" : 2, 
"node" : { "query" : "Index AND Ventures" } 
}] 
}] 
} 
} 
}] 
}] 
}}}}' 
}}}}' 
name : Elasticsearch 
category_code : search 
funding_rounds 
investments 
name : Data Collective 
type : financial-org 
funding_rounds 
investments 
name : Index Ventures 
type : financial-org 
11 
22 
33 
44 
LOD2 Webinar . 24.06.2014 . Page 35 http://lod2.eu
Creating Knowledge out of Interlinked 
Data 
Searching: Proximity 
Query 
$ curl -XGET 'localhost:9200/test/companies/_search' –d '{ 
"query" : { "hybrid-json" : { 
$ curl -XGET 'localhost:9200/test/companies/_search' –d '{ 
"query" : { "hybrid-json" : { 
"boolean" : { 
"boolean" : { 
"inOrder" : true, 
"slop" : 0, 
"clause" : [{ 
"inOrder" : true, 
"slop" : 0, 
"clause" : [{ 
"twig" : { 
"twig" : { 
"root" : "funding_rounds", 
"descendant" : [{ 
"root" : "funding_rounds", 
"descendant" : [{ 
"level" : 2, 
"node" : { "query" : "Data AND Collective" } 
"level" : 2, 
"node" : { "query" : "Data AND Collective" } 
}] 
}] 
} 
} 
},{ 
},{ 
"twig" : { 
"twig" : { 
"root" : "funding_rounds", 
"descendant" : [{ 
"root" : "funding_rounds", 
"descendant" : [{ 
"level" : 2, 
"node" : { "query" : "Index AND Ventures" } 
"level" : 2, 
"node" : { "query" : "Index AND Ventures" } 
}] 
}] 
} 
} 
}] 
}] 
}}}}' 
}}}}' 
name : Elasticsearch 
category_code : search 
funding_rounds 
investments 
name : Data Collective 
type : financial-org 
funding_rounds 
investments 
name : Index Ventures 
type : financial-org 
11 
22 
33 
44 
LOD2 Webinar . 24.06.2014 . Page 36 http://lod2.eu
Creating Knowledge out of Interlinked 
Data 
Searching: Proximity 
Query 
$ curl -XGET 'localhost:9200/test/companies/_search' –d '{ 
"query" : { "hybrid-json" : { 
$ curl -XGET 'localhost:9200/test/companies/_search' –d '{ 
"query" : { "hybrid-json" : { 
"boolean" : { 
"boolean" : { 
"inOrder" : true, 
"slop" : 0, 
"clause" : [{ 
"inOrder" : true, 
"slop" : 0, 
"clause" : [{ 
"twig" : { 
"twig" : { 
"root" : "funding_rounds", 
"descendant" : [{ 
"root" : "funding_rounds", 
"descendant" : [{ 
"level" : 2, 
"node" : { "query" : "Data AND Collective" } 
"level" : 2, 
"node" : { "query" : "Data AND Collective" } 
}] 
}] 
} 
} 
},{ 
},{ 
"twig" : { 
"twig" : { 
"root" : "funding_rounds", 
"descendant" : [{ 
"root" : "funding_rounds", 
"descendant" : [{ 
"level" : 2, 
"node" : { "query" : "Index AND Ventures" } 
"level" : 2, 
"node" : { "query" : "Index AND Ventures" } 
}] 
}] 
} 
} 
}] 
}] 
}}}}' 
}}}}' 
name : Elasticsearch 
category_code : search 
funding_rounds 
investments 
name : Data Collective 
type : financial-org 
funding_rounds 
investments 
name : Index Ventures 
type : financial-org 
11 
22 
33 
44 
LOD2 Webinar . 24.06.2014 . Page 37 http://lod2.eu
Creating Knowledge out of Interlinked 
Data 
Demo 
LOD2 Webinar . 24.06.2014 . Page 38 http://lod2.eu
Creating Knowledge out of Interlinked 
Data 
Conclusion 
• SIREn’s Key Feature 
– Dynamic/Schema-less Data Management 
– Nested Data 
– High performance and scalability 
– Powerful search operators 
– Elasticsearch/Solr integration 
• Contact 
– http://sirendb.com 
– SindiceTech 
– info@sindicetech.com 
LOD2 Webinar . 24.06.2014 . Page 39 http://lod2.eu
Creating Knowledge out of Interlinked 
Data 
Credits 
Jingle R.E.M., Martin Kaltenböck, Florian Kondert 
Coordination Thomas Thurner 
Martin Kaltenböck 
Moderation Martin Kaltenböck 
Presented by Renaud Delbru, Harish Kumar 
LOD2 Webinar . 24.06.2014 . Page 40 http://lod2.eu
Creating Knowledge out of Interlinked 
Data 
http://lod2.eu 
Hope you enjoyed staying with us – if you need more detailed 
information, visit us at www.lod2.eu and let us know how we can 
improve to meet your expectations! 
Don’t forget to register for our next webinar 
20.12. 2011 - Virtuoso (Open Link Software) 
24.01. 2012 - OntoWiki (University of Leipzig, Germany) 
Have a great day and don’t forget ... 
LOD2 Webinar . 24.06.2014 . Page 41 http://lod2.eu
Creating Knowledge out of Interlinked 
Data 
http://lod2.eu 
LOD2 Webinar . 24.06.2014 . Page 42 http://lod2.eu

Más contenido relacionado

La actualidad más candente

Soren Auer - LOD2 - creating knowledge out of Interlinked Data
Soren Auer - LOD2 - creating knowledge out of Interlinked DataSoren Auer - LOD2 - creating knowledge out of Interlinked Data
Soren Auer - LOD2 - creating knowledge out of Interlinked DataOpen City Foundation
 
Setting up Dataverse repository for research data
Setting up Dataverse repository for research dataSetting up Dataverse repository for research data
Setting up Dataverse repository for research datavty
 
Standardizing for Open Data
Standardizing for Open DataStandardizing for Open Data
Standardizing for Open DataIvan Herman
 
Industry Ontologies: Case Studies in Creating and Extending Schema.org
Industry Ontologies: Case Studies in Creating and Extending Schema.org Industry Ontologies: Case Studies in Creating and Extending Schema.org
Industry Ontologies: Case Studies in Creating and Extending Schema.org sopekmir
 
Introduction to the Data Web, DBpedia and the Life-cycle of Linked Data
Introduction to the Data Web, DBpedia and the Life-cycle of Linked DataIntroduction to the Data Web, DBpedia and the Life-cycle of Linked Data
Introduction to the Data Web, DBpedia and the Life-cycle of Linked DataSören Auer
 
Semantic Web Landscape 2009
Semantic Web Landscape 2009Semantic Web Landscape 2009
Semantic Web Landscape 2009LeeFeigenbaum
 
Technical integration of data repositories status and challenges
Technical integration of data repositories status and challengesTechnical integration of data repositories status and challenges
Technical integration of data repositories status and challengesvty
 
The world of Docker and Kubernetes
The world of Docker and Kubernetes The world of Docker and Kubernetes
The world of Docker and Kubernetes vty
 
Clariah Tech Day: Controlled Vocabularies and Ontologies in Dataverse
Clariah Tech Day: Controlled Vocabularies and Ontologies in DataverseClariah Tech Day: Controlled Vocabularies and Ontologies in Dataverse
Clariah Tech Day: Controlled Vocabularies and Ontologies in Dataversevty
 
External controlled vocabularies support in Dataverse
External controlled vocabularies support in DataverseExternal controlled vocabularies support in Dataverse
External controlled vocabularies support in Dataversevty
 
CLARIN CMDI support in Dataverse
CLARIN CMDI support in Dataverse CLARIN CMDI support in Dataverse
CLARIN CMDI support in Dataverse vty
 
GraphChain
GraphChainGraphChain
GraphChainsopekmir
 

La actualidad más candente (20)

LOD2 Webinar Series: Zemanta / Open refine
LOD2 Webinar Series: Zemanta / Open refine LOD2 Webinar Series: Zemanta / Open refine
LOD2 Webinar Series: Zemanta / Open refine
 
Soren Auer - LOD2 - creating knowledge out of Interlinked Data
Soren Auer - LOD2 - creating knowledge out of Interlinked DataSoren Auer - LOD2 - creating knowledge out of Interlinked Data
Soren Auer - LOD2 - creating knowledge out of Interlinked Data
 
LOD2: State of Play WP5 - Linked Data Visualization, Browsing and Authoring
LOD2: State of Play WP5 - Linked Data Visualization, Browsing and AuthoringLOD2: State of Play WP5 - Linked Data Visualization, Browsing and Authoring
LOD2: State of Play WP5 - Linked Data Visualization, Browsing and Authoring
 
Setting up Dataverse repository for research data
Setting up Dataverse repository for research dataSetting up Dataverse repository for research data
Setting up Dataverse repository for research data
 
Lod2
Lod2Lod2
Lod2
 
LOD2 webinar series: Virtuoso by OpenLink Software
LOD2 webinar series: Virtuoso by OpenLink SoftwareLOD2 webinar series: Virtuoso by OpenLink Software
LOD2 webinar series: Virtuoso by OpenLink Software
 
Standardizing for Open Data
Standardizing for Open DataStandardizing for Open Data
Standardizing for Open Data
 
Industry Ontologies: Case Studies in Creating and Extending Schema.org
Industry Ontologies: Case Studies in Creating and Extending Schema.org Industry Ontologies: Case Studies in Creating and Extending Schema.org
Industry Ontologies: Case Studies in Creating and Extending Schema.org
 
Introduction to the Data Web, DBpedia and the Life-cycle of Linked Data
Introduction to the Data Web, DBpedia and the Life-cycle of Linked DataIntroduction to the Data Web, DBpedia and the Life-cycle of Linked Data
Introduction to the Data Web, DBpedia and the Life-cycle of Linked Data
 
Semantic Web Landscape 2009
Semantic Web Landscape 2009Semantic Web Landscape 2009
Semantic Web Landscape 2009
 
LOD2: State of Play WP3A - Knowledge Base Creation, Enrichment and Repair
LOD2: State of Play WP3A - Knowledge Base Creation, Enrichment and RepairLOD2: State of Play WP3A - Knowledge Base Creation, Enrichment and Repair
LOD2: State of Play WP3A - Knowledge Base Creation, Enrichment and Repair
 
Free Webinar: LOD2 Stack - 1st release
Free Webinar: LOD2 Stack - 1st releaseFree Webinar: LOD2 Stack - 1st release
Free Webinar: LOD2 Stack - 1st release
 
Technical integration of data repositories status and challenges
Technical integration of data repositories status and challengesTechnical integration of data repositories status and challenges
Technical integration of data repositories status and challenges
 
The world of Docker and Kubernetes
The world of Docker and Kubernetes The world of Docker and Kubernetes
The world of Docker and Kubernetes
 
Clariah Tech Day: Controlled Vocabularies and Ontologies in Dataverse
Clariah Tech Day: Controlled Vocabularies and Ontologies in DataverseClariah Tech Day: Controlled Vocabularies and Ontologies in Dataverse
Clariah Tech Day: Controlled Vocabularies and Ontologies in Dataverse
 
External controlled vocabularies support in Dataverse
External controlled vocabularies support in DataverseExternal controlled vocabularies support in Dataverse
External controlled vocabularies support in Dataverse
 
CLARIN CMDI support in Dataverse
CLARIN CMDI support in Dataverse CLARIN CMDI support in Dataverse
CLARIN CMDI support in Dataverse
 
KEDL DBpedia 2019
KEDL DBpedia  2019KEDL DBpedia  2019
KEDL DBpedia 2019
 
GraphChain
GraphChainGraphChain
GraphChain
 
LOD2: State of Play WP1: Requirements, Design & LOD2 Stack Prototype
LOD2: State of Play WP1: Requirements, Design & LOD2 Stack PrototypeLOD2: State of Play WP1: Requirements, Design & LOD2 Stack Prototype
LOD2: State of Play WP1: Requirements, Design & LOD2 Stack Prototype
 

Similar a LOD2 Webinar: SIREn

High Performance JSON Search and Relational Faceted Browsing with Lucene
High Performance JSON Search and Relational Faceted Browsing with LuceneHigh Performance JSON Search and Relational Faceted Browsing with Lucene
High Performance JSON Search and Relational Faceted Browsing with Lucenelucenerevolution
 
Linked Open Data Visualization
Linked Open Data VisualizationLinked Open Data Visualization
Linked Open Data VisualizationLaura Po
 
(PROJEKTURA) Big Data Open Data story for TGG
(PROJEKTURA) Big Data Open Data story for TGG(PROJEKTURA) Big Data Open Data story for TGG
(PROJEKTURA) Big Data Open Data story for TGGRatko Mutavdzic
 
Accessing the Linked Open Data Cloud via ODBC
Accessing the Linked Open Data Cloud via ODBCAccessing the Linked Open Data Cloud via ODBC
Accessing the Linked Open Data Cloud via ODBCKingsley Uyi Idehen
 
(PROJEKTURA) open data big data @tgg osijek
(PROJEKTURA) open data big data @tgg osijek(PROJEKTURA) open data big data @tgg osijek
(PROJEKTURA) open data big data @tgg osijekRatko Mutavdzic
 
OpenAIRE and the Case of Irish Repositories
OpenAIRE and the Case of Irish RepositoriesOpenAIRE and the Case of Irish Repositories
OpenAIRE and the Case of Irish RepositoriesRIANIreland
 
OpenAIRE and the case of Irish Repositories, by Jochen Schirrwagen (RIAN Work...
OpenAIRE and the case of Irish Repositories, by Jochen Schirrwagen (RIAN Work...OpenAIRE and the case of Irish Repositories, by Jochen Schirrwagen (RIAN Work...
OpenAIRE and the case of Irish Repositories, by Jochen Schirrwagen (RIAN Work...OpenAIRE
 
GraphTour 2020 - Neo4j: What's New?
GraphTour 2020 - Neo4j: What's New?GraphTour 2020 - Neo4j: What's New?
GraphTour 2020 - Neo4j: What's New?Neo4j
 
Exploring the Semantic Web
Exploring the Semantic WebExploring the Semantic Web
Exploring the Semantic WebRoberto García
 
The LOD Gateway: Open Source Infrastructure for Linked Data
The LOD Gateway: Open Source Infrastructure for Linked DataThe LOD Gateway: Open Source Infrastructure for Linked Data
The LOD Gateway: Open Source Infrastructure for Linked DataDavid Newbury
 
Facilitating Data Curation: a Solution Developed in the Toxicology Domain
Facilitating Data Curation: a Solution Developed in the Toxicology DomainFacilitating Data Curation: a Solution Developed in the Toxicology Domain
Facilitating Data Curation: a Solution Developed in the Toxicology DomainChristophe Debruyne
 
“Publishing and Consuming Linked Data. (Lessons learnt when using LOD in an a...
“Publishing and Consuming Linked Data. (Lessons learnt when using LOD in an a...“Publishing and Consuming Linked Data. (Lessons learnt when using LOD in an a...
“Publishing and Consuming Linked Data. (Lessons learnt when using LOD in an a...Marta Villegas
 
Knowledge Graph Introduction
Knowledge Graph IntroductionKnowledge Graph Introduction
Knowledge Graph IntroductionSören Auer
 
Linked Data: opportunities and challenges
Linked Data: opportunities and challengesLinked Data: opportunities and challenges
Linked Data: opportunities and challengesMichael Hausenblas
 
The nature.com ontologies portal: nature.com/ontologies
The nature.com ontologies portal: nature.com/ontologiesThe nature.com ontologies portal: nature.com/ontologies
The nature.com ontologies portal: nature.com/ontologiesTony Hammond
 
Rank | Analyse | Lead | Search
Rank | Analyse | Lead | SearchRank | Analyse | Lead | Search
Rank | Analyse | Lead | Searchsopekmir
 
Producing, publishing and consuming linked data - CSHALS 2013
Producing, publishing and consuming linked data - CSHALS 2013Producing, publishing and consuming linked data - CSHALS 2013
Producing, publishing and consuming linked data - CSHALS 2013François Belleau
 
Searching Relational Data with Elasticsearch
Searching Relational Data with ElasticsearchSearching Relational Data with Elasticsearch
Searching Relational Data with Elasticsearchsirensolutions
 

Similar a LOD2 Webinar: SIREn (20)

High Performance JSON Search and Relational Faceted Browsing with Lucene
High Performance JSON Search and Relational Faceted Browsing with LuceneHigh Performance JSON Search and Relational Faceted Browsing with Lucene
High Performance JSON Search and Relational Faceted Browsing with Lucene
 
Linked Open Data Visualization
Linked Open Data VisualizationLinked Open Data Visualization
Linked Open Data Visualization
 
(PROJEKTURA) Big Data Open Data story for TGG
(PROJEKTURA) Big Data Open Data story for TGG(PROJEKTURA) Big Data Open Data story for TGG
(PROJEKTURA) Big Data Open Data story for TGG
 
Accessing the Linked Open Data Cloud via ODBC
Accessing the Linked Open Data Cloud via ODBCAccessing the Linked Open Data Cloud via ODBC
Accessing the Linked Open Data Cloud via ODBC
 
(PROJEKTURA) open data big data @tgg osijek
(PROJEKTURA) open data big data @tgg osijek(PROJEKTURA) open data big data @tgg osijek
(PROJEKTURA) open data big data @tgg osijek
 
OpenAIRE and the Case of Irish Repositories
OpenAIRE and the Case of Irish RepositoriesOpenAIRE and the Case of Irish Repositories
OpenAIRE and the Case of Irish Repositories
 
OpenAIRE and the case of Irish Repositories, by Jochen Schirrwagen (RIAN Work...
OpenAIRE and the case of Irish Repositories, by Jochen Schirrwagen (RIAN Work...OpenAIRE and the case of Irish Repositories, by Jochen Schirrwagen (RIAN Work...
OpenAIRE and the case of Irish Repositories, by Jochen Schirrwagen (RIAN Work...
 
GraphTour 2020 - Neo4j: What's New?
GraphTour 2020 - Neo4j: What's New?GraphTour 2020 - Neo4j: What's New?
GraphTour 2020 - Neo4j: What's New?
 
LOD2 Webinar Series: Virtuoso 7
LOD2 Webinar Series: Virtuoso 7LOD2 Webinar Series: Virtuoso 7
LOD2 Webinar Series: Virtuoso 7
 
Exploring the Semantic Web
Exploring the Semantic WebExploring the Semantic Web
Exploring the Semantic Web
 
The LOD Gateway: Open Source Infrastructure for Linked Data
The LOD Gateway: Open Source Infrastructure for Linked DataThe LOD Gateway: Open Source Infrastructure for Linked Data
The LOD Gateway: Open Source Infrastructure for Linked Data
 
Facilitating Data Curation: a Solution Developed in the Toxicology Domain
Facilitating Data Curation: a Solution Developed in the Toxicology DomainFacilitating Data Curation: a Solution Developed in the Toxicology Domain
Facilitating Data Curation: a Solution Developed in the Toxicology Domain
 
“Publishing and Consuming Linked Data. (Lessons learnt when using LOD in an a...
“Publishing and Consuming Linked Data. (Lessons learnt when using LOD in an a...“Publishing and Consuming Linked Data. (Lessons learnt when using LOD in an a...
“Publishing and Consuming Linked Data. (Lessons learnt when using LOD in an a...
 
Knowledge Graph Introduction
Knowledge Graph IntroductionKnowledge Graph Introduction
Knowledge Graph Introduction
 
Linked Data: opportunities and challenges
Linked Data: opportunities and challengesLinked Data: opportunities and challenges
Linked Data: opportunities and challenges
 
The nature.com ontologies portal: nature.com/ontologies
The nature.com ontologies portal: nature.com/ontologiesThe nature.com ontologies portal: nature.com/ontologies
The nature.com ontologies portal: nature.com/ontologies
 
Rank | Analyse | Lead | Search
Rank | Analyse | Lead | SearchRank | Analyse | Lead | Search
Rank | Analyse | Lead | Search
 
Where is the World is my Open Government Data?
Where is the World is my Open Government Data?Where is the World is my Open Government Data?
Where is the World is my Open Government Data?
 
Producing, publishing and consuming linked data - CSHALS 2013
Producing, publishing and consuming linked data - CSHALS 2013Producing, publishing and consuming linked data - CSHALS 2013
Producing, publishing and consuming linked data - CSHALS 2013
 
Searching Relational Data with Elasticsearch
Searching Relational Data with ElasticsearchSearching Relational Data with Elasticsearch
Searching Relational Data with Elasticsearch
 

Más de LOD2 Creating Knowledge out of Interlinked Data

Más de LOD2 Creating Knowledge out of Interlinked Data (19)

LOD2 Webinar Series: DBpedia Spotlight
LOD2 Webinar Series: DBpedia SpotlightLOD2 Webinar Series: DBpedia Spotlight
LOD2 Webinar Series: DBpedia Spotlight
 
LOD2 Webinar Series: publicdata.eu and CKAN
LOD2 Webinar Series: publicdata.eu and CKANLOD2 Webinar Series: publicdata.eu and CKAN
LOD2 Webinar Series: publicdata.eu and CKAN
 
LOD2 Webinar Series: LOD2 in information and publishing industry
LOD2 Webinar Series: LOD2 in information and publishing industryLOD2 Webinar Series: LOD2 in information and publishing industry
LOD2 Webinar Series: LOD2 in information and publishing industry
 
LOD2 General Presentation 2012
LOD2 General Presentation 2012LOD2 General Presentation 2012
LOD2 General Presentation 2012
 
LOD2 Webinar Series: PoolParty
LOD2 Webinar Series: PoolPartyLOD2 Webinar Series: PoolParty
LOD2 Webinar Series: PoolParty
 
LOD2 Webinar Series: LIMES
LOD2 Webinar Series: LIMESLOD2 Webinar Series: LIMES
LOD2 Webinar Series: LIMES
 
LOD2 Plenary Vienna 2012: WP12 - Project Management
LOD2 Plenary Vienna 2012: WP12 - Project ManagementLOD2 Plenary Vienna 2012: WP12 - Project Management
LOD2 Plenary Vienna 2012: WP12 - Project Management
 
LOD2 Plenary Vienna 2012: WP10 - Training, Dissemination, Community Building,...
LOD2 Plenary Vienna 2012: WP10 - Training, Dissemination, Community Building,...LOD2 Plenary Vienna 2012: WP10 - Training, Dissemination, Community Building,...
LOD2 Plenary Vienna 2012: WP10 - Training, Dissemination, Community Building,...
 
LOD2 Plenary Vienna 2012: WP9A - LOD for a Distributed Marketplace for Public...
LOD2 Plenary Vienna 2012: WP9A - LOD for a Distributed Marketplace for Public...LOD2 Plenary Vienna 2012: WP9A - LOD for a Distributed Marketplace for Public...
LOD2 Plenary Vienna 2012: WP9A - LOD for a Distributed Marketplace for Public...
 
LOD2 Plenary Vienna 2012: WP9 publicdata.eu – Publishing Governmental Informa...
LOD2 Plenary Vienna 2012: WP9 publicdata.eu – Publishing Governmental Informa...LOD2 Plenary Vienna 2012: WP9 publicdata.eu – Publishing Governmental Informa...
LOD2 Plenary Vienna 2012: WP9 publicdata.eu – Publishing Governmental Informa...
 
LOD2 Plenary Vienna 2012: WP8: Linked Open Data for Enterprise Data Web
LOD2 Plenary Vienna 2012: WP8: Linked Open Data for Enterprise Data WebLOD2 Plenary Vienna 2012: WP8: Linked Open Data for Enterprise Data Web
LOD2 Plenary Vienna 2012: WP8: Linked Open Data for Enterprise Data Web
 
LOD2 Plenary Vienna 2012: WP7 - Linked Open Data for Media and Publishing
LOD2 Plenary Vienna 2012: WP7 - Linked Open Data for Media and Publishing LOD2 Plenary Vienna 2012: WP7 - Linked Open Data for Media and Publishing
LOD2 Plenary Vienna 2012: WP7 - Linked Open Data for Media and Publishing
 
LOD2 Plenary Vienna 2012: WP6 - Interfaces, Integration & LOD2 Stack
LOD2 Plenary Vienna 2012: WP6 - Interfaces, Integration & LOD2 StackLOD2 Plenary Vienna 2012: WP6 - Interfaces, Integration & LOD2 Stack
LOD2 Plenary Vienna 2012: WP6 - Interfaces, Integration & LOD2 Stack
 
LOD2 Plenary Vienna 2012: WP5 - Linked Data Browsing, Visualization and Autho...
LOD2 Plenary Vienna 2012: WP5 - Linked Data Browsing, Visualization and Autho...LOD2 Plenary Vienna 2012: WP5 - Linked Data Browsing, Visualization and Autho...
LOD2 Plenary Vienna 2012: WP5 - Linked Data Browsing, Visualization and Autho...
 
LOD2 Plenary Vienna 2012: WP4 - Reuse, Interlinking and Knowledge Fusion
LOD2 Plenary Vienna 2012: WP4 - Reuse, Interlinking and Knowledge FusionLOD2 Plenary Vienna 2012: WP4 - Reuse, Interlinking and Knowledge Fusion
LOD2 Plenary Vienna 2012: WP4 - Reuse, Interlinking and Knowledge Fusion
 
LOD2 Plenary Vienna 2012: WP2 - Storing and Querying Very Large Knowledge Bases
LOD2 Plenary Vienna 2012: WP2 - Storing and Querying Very Large Knowledge BasesLOD2 Plenary Vienna 2012: WP2 - Storing and Querying Very Large Knowledge Bases
LOD2 Plenary Vienna 2012: WP2 - Storing and Querying Very Large Knowledge Bases
 
LOD2 Webinar Series: SILK
LOD2 Webinar Series: SILKLOD2 Webinar Series: SILK
LOD2 Webinar Series: SILK
 
LOD2 Webinar Series: OntoWiki
LOD2 Webinar Series: OntoWikiLOD2 Webinar Series: OntoWiki
LOD2 Webinar Series: OntoWiki
 
LOD2 Plenary Meeting 2011: Institute Mihajlo Pupin – Partner Introduction
LOD2 Plenary Meeting 2011: Institute Mihajlo Pupin – Partner IntroductionLOD2 Plenary Meeting 2011: Institute Mihajlo Pupin – Partner Introduction
LOD2 Plenary Meeting 2011: Institute Mihajlo Pupin – Partner Introduction
 

Último

IESVE Software for Florida Code Compliance Using ASHRAE 90.1-2019
IESVE Software for Florida Code Compliance Using ASHRAE 90.1-2019IESVE Software for Florida Code Compliance Using ASHRAE 90.1-2019
IESVE Software for Florida Code Compliance Using ASHRAE 90.1-2019IES VE
 
ADOPTING WEB 3 FOR YOUR BUSINESS: A STEP-BY-STEP GUIDE
ADOPTING WEB 3 FOR YOUR BUSINESS: A STEP-BY-STEP GUIDEADOPTING WEB 3 FOR YOUR BUSINESS: A STEP-BY-STEP GUIDE
ADOPTING WEB 3 FOR YOUR BUSINESS: A STEP-BY-STEP GUIDELiveplex
 
Empowering Africa's Next Generation: The AI Leadership Blueprint
Empowering Africa's Next Generation: The AI Leadership BlueprintEmpowering Africa's Next Generation: The AI Leadership Blueprint
Empowering Africa's Next Generation: The AI Leadership BlueprintMahmoud Rabie
 
UiPath Community: AI for UiPath Automation Developers
UiPath Community: AI for UiPath Automation DevelopersUiPath Community: AI for UiPath Automation Developers
UiPath Community: AI for UiPath Automation DevelopersUiPathCommunity
 
Designing A Time bound resource download URL
Designing A Time bound resource download URLDesigning A Time bound resource download URL
Designing A Time bound resource download URLRuncy Oommen
 
Bird eye's view on Camunda open source ecosystem
Bird eye's view on Camunda open source ecosystemBird eye's view on Camunda open source ecosystem
Bird eye's view on Camunda open source ecosystemAsko Soukka
 
Anypoint Code Builder , Google Pub sub connector and MuleSoft RPA
Anypoint Code Builder , Google Pub sub connector and MuleSoft RPAAnypoint Code Builder , Google Pub sub connector and MuleSoft RPA
Anypoint Code Builder , Google Pub sub connector and MuleSoft RPAshyamraj55
 
Secure your environment with UiPath and CyberArk technologies - Session 1
Secure your environment with UiPath and CyberArk technologies - Session 1Secure your environment with UiPath and CyberArk technologies - Session 1
Secure your environment with UiPath and CyberArk technologies - Session 1DianaGray10
 
Basic Building Blocks of Internet of Things.
Basic Building Blocks of Internet of Things.Basic Building Blocks of Internet of Things.
Basic Building Blocks of Internet of Things.YounusS2
 
Comparing Sidecar-less Service Mesh from Cilium and Istio
Comparing Sidecar-less Service Mesh from Cilium and IstioComparing Sidecar-less Service Mesh from Cilium and Istio
Comparing Sidecar-less Service Mesh from Cilium and IstioChristian Posta
 
Cybersecurity Workshop #1.pptx
Cybersecurity Workshop #1.pptxCybersecurity Workshop #1.pptx
Cybersecurity Workshop #1.pptxGDSC PJATK
 
KubeConEU24-Monitoring Kubernetes and Cloud Spend with OpenCost
KubeConEU24-Monitoring Kubernetes and Cloud Spend with OpenCostKubeConEU24-Monitoring Kubernetes and Cloud Spend with OpenCost
KubeConEU24-Monitoring Kubernetes and Cloud Spend with OpenCostMatt Ray
 
COMPUTER 10 Lesson 8 - Building a Website
COMPUTER 10 Lesson 8 - Building a WebsiteCOMPUTER 10 Lesson 8 - Building a Website
COMPUTER 10 Lesson 8 - Building a Websitedgelyza
 
Videogame localization & technology_ how to enhance the power of translation.pdf
Videogame localization & technology_ how to enhance the power of translation.pdfVideogame localization & technology_ how to enhance the power of translation.pdf
Videogame localization & technology_ how to enhance the power of translation.pdfinfogdgmi
 
The Data Metaverse: Unpacking the Roles, Use Cases, and Tech Trends in Data a...
The Data Metaverse: Unpacking the Roles, Use Cases, and Tech Trends in Data a...The Data Metaverse: Unpacking the Roles, Use Cases, and Tech Trends in Data a...
The Data Metaverse: Unpacking the Roles, Use Cases, and Tech Trends in Data a...Aggregage
 
AI Fame Rush Review – Virtual Influencer Creation In Just Minutes
AI Fame Rush Review – Virtual Influencer Creation In Just MinutesAI Fame Rush Review – Virtual Influencer Creation In Just Minutes
AI Fame Rush Review – Virtual Influencer Creation In Just MinutesMd Hossain Ali
 
Salesforce Miami User Group Event - 1st Quarter 2024
Salesforce Miami User Group Event - 1st Quarter 2024Salesforce Miami User Group Event - 1st Quarter 2024
Salesforce Miami User Group Event - 1st Quarter 2024SkyPlanner
 

Último (20)

IESVE Software for Florida Code Compliance Using ASHRAE 90.1-2019
IESVE Software for Florida Code Compliance Using ASHRAE 90.1-2019IESVE Software for Florida Code Compliance Using ASHRAE 90.1-2019
IESVE Software for Florida Code Compliance Using ASHRAE 90.1-2019
 
ADOPTING WEB 3 FOR YOUR BUSINESS: A STEP-BY-STEP GUIDE
ADOPTING WEB 3 FOR YOUR BUSINESS: A STEP-BY-STEP GUIDEADOPTING WEB 3 FOR YOUR BUSINESS: A STEP-BY-STEP GUIDE
ADOPTING WEB 3 FOR YOUR BUSINESS: A STEP-BY-STEP GUIDE
 
Empowering Africa's Next Generation: The AI Leadership Blueprint
Empowering Africa's Next Generation: The AI Leadership BlueprintEmpowering Africa's Next Generation: The AI Leadership Blueprint
Empowering Africa's Next Generation: The AI Leadership Blueprint
 
UiPath Community: AI for UiPath Automation Developers
UiPath Community: AI for UiPath Automation DevelopersUiPath Community: AI for UiPath Automation Developers
UiPath Community: AI for UiPath Automation Developers
 
Designing A Time bound resource download URL
Designing A Time bound resource download URLDesigning A Time bound resource download URL
Designing A Time bound resource download URL
 
Bird eye's view on Camunda open source ecosystem
Bird eye's view on Camunda open source ecosystemBird eye's view on Camunda open source ecosystem
Bird eye's view on Camunda open source ecosystem
 
Anypoint Code Builder , Google Pub sub connector and MuleSoft RPA
Anypoint Code Builder , Google Pub sub connector and MuleSoft RPAAnypoint Code Builder , Google Pub sub connector and MuleSoft RPA
Anypoint Code Builder , Google Pub sub connector and MuleSoft RPA
 
Secure your environment with UiPath and CyberArk technologies - Session 1
Secure your environment with UiPath and CyberArk technologies - Session 1Secure your environment with UiPath and CyberArk technologies - Session 1
Secure your environment with UiPath and CyberArk technologies - Session 1
 
Basic Building Blocks of Internet of Things.
Basic Building Blocks of Internet of Things.Basic Building Blocks of Internet of Things.
Basic Building Blocks of Internet of Things.
 
Comparing Sidecar-less Service Mesh from Cilium and Istio
Comparing Sidecar-less Service Mesh from Cilium and IstioComparing Sidecar-less Service Mesh from Cilium and Istio
Comparing Sidecar-less Service Mesh from Cilium and Istio
 
Cybersecurity Workshop #1.pptx
Cybersecurity Workshop #1.pptxCybersecurity Workshop #1.pptx
Cybersecurity Workshop #1.pptx
 
KubeConEU24-Monitoring Kubernetes and Cloud Spend with OpenCost
KubeConEU24-Monitoring Kubernetes and Cloud Spend with OpenCostKubeConEU24-Monitoring Kubernetes and Cloud Spend with OpenCost
KubeConEU24-Monitoring Kubernetes and Cloud Spend with OpenCost
 
COMPUTER 10 Lesson 8 - Building a Website
COMPUTER 10 Lesson 8 - Building a WebsiteCOMPUTER 10 Lesson 8 - Building a Website
COMPUTER 10 Lesson 8 - Building a Website
 
Videogame localization & technology_ how to enhance the power of translation.pdf
Videogame localization & technology_ how to enhance the power of translation.pdfVideogame localization & technology_ how to enhance the power of translation.pdf
Videogame localization & technology_ how to enhance the power of translation.pdf
 
201610817 - edge part1
201610817 - edge part1201610817 - edge part1
201610817 - edge part1
 
20230104 - machine vision
20230104 - machine vision20230104 - machine vision
20230104 - machine vision
 
The Data Metaverse: Unpacking the Roles, Use Cases, and Tech Trends in Data a...
The Data Metaverse: Unpacking the Roles, Use Cases, and Tech Trends in Data a...The Data Metaverse: Unpacking the Roles, Use Cases, and Tech Trends in Data a...
The Data Metaverse: Unpacking the Roles, Use Cases, and Tech Trends in Data a...
 
AI Fame Rush Review – Virtual Influencer Creation In Just Minutes
AI Fame Rush Review – Virtual Influencer Creation In Just MinutesAI Fame Rush Review – Virtual Influencer Creation In Just Minutes
AI Fame Rush Review – Virtual Influencer Creation In Just Minutes
 
20150722 - AGV
20150722 - AGV20150722 - AGV
20150722 - AGV
 
Salesforce Miami User Group Event - 1st Quarter 2024
Salesforce Miami User Group Event - 1st Quarter 2024Salesforce Miami User Group Event - 1st Quarter 2024
Salesforce Miami User Group Event - 1st Quarter 2024
 

LOD2 Webinar: SIREn

  • 1. Creating Knowledge out of Interlinked Data LOD2 Webinar . 24.06.2014 . Page 1 http://lod2.eu
  • 2. Creating Knowledge out of Interlinked Data http://lod2.eu LOD2 is a large-scale integrating project co-funded by the European Commission within the FP7 Information and Communication Technologies Work Programme. This 4-year project comprises leading Linked Open Data technology researchers, companies, and service providers. Coming from across 12 countries the partners are coordinated by the Agile Knowledge Engineering and Semantic Web Research Group at the University of Leipzig, Germany. LOD2 will integrate and syndicate Linked Data with existing large-scale applications. The project shows the benefits in the scenarios of Media and Publishing, Corporate Data intranets and eGovernment. LOD2 Webinar . 24.06.2014 . Page 2 http://lod2.eu
  • 3. Creating Knowledge out of Interlinked Data http://lod2.eu Once per month the LOD2 webinar series offer a free webinar about tools and services along the Linked Open Data Life Cycle. Stay with us and learn more about acquisition, editing, composing, connected applications – and finally publishing Linked Open Data. LOD2 Webinar . 24.06.2014 . Page 3 http://lod2.eu
  • 4. Creating Knowledge out of Interlinked Data Agenda • Nested Data Model • SIREn Overview • Getting Started with the SIREn Elasticsearch Plugin • Demo LOD2 Webinar . 24.06.2014 . Page 4 http://lod2.eu
  • 5. Creating Knowledge out of Interlinked Data Schema-Less Nested Data Model • Model becoming prevalent: JSON, XML, Avro, … – Can be arbitrarily nested and large – No strict schema / structure enforced • Schema-less brings – Flexibility – Ease of development • Developers do not have to invest significant modelling effort upfront LOD2 Webinar . 24.06.2014 . Page 5 http://lod2.eu
  • 6. Creating Knowledge out of Interlinked Data Introducing SIREn • Lucene, Solr and Elasticsearch plugin for indexing and searching JSON • Rich data model (JSON) – Nested objects, nested arrays, datatypes – Generic architecture compatible with various nested data models: JSON, JSON-LD, XML, Avro, ... • Schema-agnostic – SIREn does not require any schema definition to index and search data – Schema definition can change across records • Designed from the ground up for high performance and scalability LOD2 Webinar . 24.06.2014 . Page 6 http://lod2.eu
  • 7. Creating Knowledge out of Interlinked Data Introducing SIREn LOD2 Webinar . 24.06.2014 . Page 7 http://lod2.eu
  • 8. Creating Knowledge out of Interlinked Data Elasticsearch - Overview • Document-oriented search and analytics engine – JSON Document – Based on Apache Lucene • Distributed, Replication – High Performance and Availability • REST API LOD2 Webinar . 24.06.2014 . Page 8 http://lod2.eu
  • 9. Creating Knowledge out of Interlinked Data Elasticsearch – Basic Concepts • Index = Collection of Documents – Can have multiple shards and replicas • Type = A set of documents sharing the same schema – Similar to a DB table • Document = JSON object – Uniquely identified (index/type/id) – Similar to a DB record LOD2 Webinar . 24.06.2014 . Page 9 http://lod2.eu
  • 10. Creating Knowledge out of Interlinked Data SIREn – Basic Concepts • JSON object = Tree – Different mapping available • Node = An element of the tree – Can have a parent and one or more children – Contains data: text, numeric, boolean LOD2 Webinar . 24.06.2014 . Page 10 http://lod2.eu
  • 11. Creating Knowledge out of Interlinked Data SIREn – Basic Concepts JSON Object Tree { { "name" : "Elasticsearch", "category_code" : "search", "funding_rounds" : [ "name" : "Elasticsearch", "category_code" : "search", "funding_rounds" : [ { { "round_code" : "a", "raised_amount" : 10000000, "funded_year" : 2012, "investments" : [ "round_code" : "a", "raised_amount" : 10000000, "funded_year" : 2012, "investments" : [ { { "name" : "Data Collective", "type" : "financial-org" "name" : "Data Collective", "type" : "financial-org" }, … }, … ] ] }, … }, … ] ] } } name : Elasticsearch category_code : search funding_rounds round_code : a raised_amount : 10000000 funded_year : 2012 investments name : Data Collective type : financial-org investments name : ... LOD2 Webinar . 24.06.2014 . Page 11 http://lod2.eu
  • 12. Creating Knowledge out of Interlinked Data Getting Started with Elasticsearch & SIREn http://sirendb.com/downloads/ (Elasticsearch Distribution Coming Soon) LOD2 Webinar . 24.06.2014 . Page 12 http://lod2.eu
  • 13. Creating Knowledge out of Interlinked Data SIREn Elasticsearch Distribution ├── dist │ ├── siren-core-1.3.jar │ └── siren-qparser-1.3.jar ├── docs │ ├── apidoc │ └── siren-1.3-javadoc.jar ├── example ├── CHANGES.txt ├── LICENSE.md ├── README.md └── THIRD-PARTY.txt ├── dist │ ├── siren-core-1.3.jar │ └── siren-qparser-1.3.jar ├── docs │ ├── apidoc │ └── siren-1.3-javadoc.jar ├── example ├── CHANGES.txt ├── LICENSE.md ├── README.md └── THIRD-PARTY.txt SIREn Libraries SIREn Libraries SIREn Javadoc SIREn Javadoc Elasticsearch Distribution Elasticsearch Distribution LOD2 Webinar . 24.06.2014 . Page 13 http://lod2.eu
  • 14. Creating Knowledge out of Interlinked Data SIREn Elasticsearch Distribution ├── bin │ ├── elasticsearch │ └── ... ├── config │ ├── elasticsearch.yml │ └── logging.yml ├── lib │ ├── elasticsearch-1.2.0.jar │ └── ... ├── plugins │ └── siren-plugin │ ├── siren-elasticsearch-1.3.jar │ └── ... ├── datasets ├── NOTICE.txt └── README.md ├── bin │ ├── elasticsearch │ └── ... ├── config │ ├── elasticsearch.yml │ └── logging.yml ├── lib │ ├── elasticsearch-1.2.0.jar │ └── ... ├── plugins │ └── siren-plugin │ ├── siren-elasticsearch-1.3.jar │ └── ... ├── datasets ├── NOTICE.txt └── README.md EExxeeccuutatabbleless CCoonnfifgiguuraratitoionn SSIRIREEnn P Plulugginin LOD2 Webinar . 24.06.2014 . Page 14 http://lod2.eu
  • 15. Creating Knowledge out of Interlinked Data Starting Elasticsearch $ ./bin/elasticsearch $ ./bin/elasticsearch [INFO ][node ] [Valerie Cooper] version[1.2.0], pid[12842], … [INFO ][node ] [Valerie Cooper] initializing ... [INFO ][plugins] [Valerie Cooper] loaded [siren-plugin], sites [] [INFO ][node ] [Valerie Cooper] initialized [INFO ][node ] [Valerie Cooper] starting ... … [INFO ][node ] [Valerie Cooper] started [INFO ][node ] [Valerie Cooper] version[1.2.0], pid[12842], … [INFO ][node ] [Valerie Cooper] initializing ... [INFO ][plugins] [Valerie Cooper] loaded [siren-plugin], sites [] [INFO ][node ] [Valerie Cooper] initialized [INFO ][node ] [Valerie Cooper] starting ... … [INFO ][node ] [Valerie Cooper] started LOD2 Webinar . 24.06.2014 . Page 15 http://lod2.eu
  • 16. Creating Knowledge out of Interlinked Data Create an Index and Map SIREn’s Field $ curl -XPUT 'http://localhost:9200/test/' –d ' { $ curl -XPUT 'http://localhost:9200/test/' –d ' { "mappings" : { "mappings" : { "companies" : { "companies" : { "properties" : { "properties" : { SIREn’s Field "_siren_source" : { "index" : "analyzed", "analyzer" : "hybrid-json", "postings_format" : "Siren10AFor", "store" : "no", "type" : "string" } }, "_siren" : {} "_siren_source" : { "index" : "analyzed", "analyzer" : "hybrid-json", "postings_format" : "Siren10AFor", "store" : "no", "type" : "string" } }, "_siren" : {} } } } } }' }' SIREn’s Analyzer SIREn’s Posting Format LOD2 Webinar . 24.06.2014 . Page 16 http://lod2.eu
  • 17. Creating Knowledge out of Interlinked Data Indexing a Document $ curl -XPUT 'http://localhost:9200/test/companies/1' –d ' { $ curl -XPUT 'http://localhost:9200/test/companies/1' –d ' { "name" : "Elasticsearch", "category_code" : "search", "funding_rounds" : [ "name" : "Elasticsearch", "category_code" : "search", "funding_rounds" : [ { { "round_code" : "a", "raised_amount" : 10000000, "funded_year" : 2012, "investments" : [ "round_code" : "a", "raised_amount" : 10000000, "funded_year" : 2012, "investments" : [ { { "name" : "Data Collective", "type" : "financial-org" "name" : "Data Collective", "type" : "financial-org" } } ] ] } } ] ] }' }' LOD2 Webinar . 24.06.2014 . Page 17 http://lod2.eu
  • 18. Creating Knowledge out of Interlinked Data Customising Datatype Analysis • SIREn distinguishes 5 types of values in the JSON model: – Field, String, Long, Double, Boolean { { "name" : "Elasticsearch", "category_code" : "search", "funding_rounds" : [ "name" : "Elasticsearch", "category_code" : "search", "funding_rounds" : [ { { "round_code" : "a", "raised_amount" : 10000000, "funded_year" : 2012, "investments" : [ "round_code" : "a", "raised_amount" : 10000000, "funded_year" : 2012, "investments" : [ { { json:field "name" : "Data Collective", "type" : "financial-org" "name" : "Data Collective", "type" : "financial-org" } } ] ] } } ] ] } } xsd:long xsd:string LOD2 Webinar . 24.06.2014 . Page 18 http://lod2.eu
  • 19. Creating Knowledge out of Interlinked Data Customising Datatype Analysis # config/elasticsearch.yml # config/elasticsearch.yml siren.analysis.datatype: siren.analysis.datatype: http://json.org/field: http://json.org/field: index_analyzer: keyword index_analyzer: keyword http://www.w3.org/2001/XMLSchema#string: http://www.w3.org/2001/XMLSchema#string: index_analyzer: simple search_analyzer: simpleFolding index_analyzer: simple search_analyzer: simpleFolding http://www.w3.org/2001/XMLSchema#long: http://www.w3.org/2001/XMLSchema#long: index_analyzer: long index_analyzer: long index.analysis.analyzer: index.analysis.analyzer: simpleFolding: simpleFolding: type: custom tokenizer: letter filter: [lowercase, asciifolding] type: custom tokenizer: letter filter: [lowercase, asciifolding] LOD2 Webinar . 24.06.2014 . Page 19 http://lod2.eu
  • 20. Creating Knowledge out of Interlinked Data Searching: Node Query $ curl -XGET 'localhost:9200/test/companies/_search' –d ' { "query" : { $ curl -XGET 'localhost:9200/test/companies/_search' –d ' { "query" : { "hybrid-json" : { "hybrid-json" : { "node" : { "node" : { "query" : "search", "query" : "search", } } } } }' }' name : Elasticsearch category_code : search funding_rounds round_code : a raised_amount : 10000000 funded_year : 2012 investments name : Data Collective type : financial-org investments name : ... SIREn’s Query Type SIREn’s Query Full-Text Query: •Boolean, Phrase, Range, Fuzzy, Regexp, Proximity, ... LOD2 Webinar . 24.06.2014 . Page 20 http://lod2.eu
  • 21. Creating Knowledge out of Interlinked Data Searching: Node Query $ curl -XGET 'localhost:9200/test/companies/_search' –d ' { "query" : { $ curl -XGET 'localhost:9200/test/companies/_search' –d ' { "query" : { "hybrid-json" : { "hybrid-json" : { "node" : { "node" : { "query" : "search", "query" : "search", } } } } }' }' name : Elasticsearch category_code : search funding_rounds round_code : a raised_amount : 10000000 funded_year : 2012 investments name : Data Collective type : financial-org investments name : ... LOD2 Webinar . 24.06.2014 . Page 21 http://lod2.eu
  • 22. Creating Knowledge out of Interlinked Data Searching: Node Query $ curl -XGET 'localhost:9200/test/companies/_search' –d ' { "query" : { $ curl -XGET 'localhost:9200/test/companies/_search' –d ' { "query" : { "hybrid-json" : { "hybrid-json" : { "node" : { "node" : { "query" : "search", "attribute" : "category_code", "query" : "search", "attribute" : "category_code", } } } } }' }' name : Elasticsearch category_code : search funding_rounds round_code : a raised_amount : 10000000 funded_year : 2012 investments name : Data Collective type : financial-org investments name : ... LOD2 Webinar . 24.06.2014 . Page 22 http://lod2.eu
  • 23. Creating Knowledge out of Interlinked Data Searching: Node Query $ curl -XGET 'localhost:9200/test/companies/_search' –d ' { "query" : { $ curl -XGET 'localhost:9200/test/companies/_search' –d ' { "query" : { "hybrid-json" : { "hybrid-json" : { "node" : { "node" : { "query" : "search", "attribute" : "category_code", "query" : "search", "attribute" : "category_code", } } } } }' }' name : Elasticsearch category_code : search funding_rounds round_code : a raised_amount : 10000000 funded_year : 2012 investments name : Data Collective type : financial-org investments name : ... LOD2 Webinar . 24.06.2014 . Page 23 http://lod2.eu
  • 24. Creating Knowledge out of Interlinked Data Searching: Twig Query • Query operators for Ancestor-Descendant and Parent- Child relationships LOD2 Webinar . 24.06.2014 . Page 24 http://lod2.eu
  • 25. Creating Knowledge out of Interlinked Data Searching: Twig Query • Query operators for Ancestor-Descendant and Parent- Child relationships • Consists of a root query Boolean LOD2 Webinar . 24.06.2014 . Page 25 http://lod2.eu
  • 26. Creating Knowledge out of Interlinked Data Searching: Twig Query • Query operators for Ancestor-Descendant and Parent- Child relationships • Consists of a root query and one or more child Boolean Phrase MUST LOD2 Webinar . 24.06.2014 . Page 26 http://lod2.eu
  • 27. Creating Knowledge out of Interlinked Data Searching: Twig Query • Query operators for Ancestor-Descendant and Parent- Child relationships • Consists of a root query and one or more child and descendant queries Boolean Phrase MUST Boolean SHOULD LOD2 Webinar . 24.06.2014 . Page 27 http://lod2.eu
  • 28. Creating Knowledge out of Interlinked Data Searching: Twig Query • Query operators for Ancestor-Descendant and Parent- Child relationships • Consists of a root query and one or more child and descendant queries • Can be nested to form complex tree structure Boolean Phrase MUST Twig NOT Range MUST Boolean SHOULD LOD2 Webinar . 24.06.2014 . Page 28 http://lod2.eu
  • 29. Creating Knowledge out of Interlinked Data Searching: Twig Query $ curl -XGET 'localhost:9200/test/companies/_search' –d ' { "query" : { $ curl -XGET 'localhost:9200/test/companies/_search' –d ' { "query" : { "hybrid-json" : { "hybrid-json" : { "twig" : { "twig" : { Twig Query Operator Root Query "root" : "funding_rounds" "child" : [{ "root" : "funding_rounds" "child" : [{ "occur" : "must", "node" : { "occur" : "must", "node" : { "query" : "2012", "attribute" : "funded_year" "query" : "2012", "attribute" : "funded_year" } } }] }] } } } } }' }' name : Elasticsearch category_code : search funding_rounds round_code : a raised_amount : 10000000 funded_year : 2012 investments name : Data Collective type : financial-org investments name : ... Child Query LOD2 Webinar . 24.06.2014 . Page 29 http://lod2.eu
  • 30. Creating Knowledge out of Interlinked Data Searching: Twig Query $ curl -XGET 'localhost:9200/test/companies/_search' –d ' { "query" : { $ curl -XGET 'localhost:9200/test/companies/_search' –d ' { "query" : { "hybrid-json" : { "hybrid-json" : { "twig" : { "twig" : { "root" : "funding_rounds" "child" : [{ "root" : "funding_rounds" "child" : [{ "occur" : "must", "node" : { "occur" : "must", "node" : { "query" : "2012", "attribute" : "funded_year" "query" : "2012", "attribute" : "funded_year" } } }] }] } } } } }' }' name : Elasticsearch category_code : search funding_rounds round_code : a raised_amount : 10000000 funded_year : 2012 investments name : Data Collective type : financial-org investments name : ... LOD2 Webinar . 24.06.2014 . Page 30 http://lod2.eu
  • 31. Creating Knowledge out of Interlinked Data Searching: Twig Query $ curl -XGET 'localhost:9200/test/companies/_search' –d ' { "query" : { $ curl -XGET 'localhost:9200/test/companies/_search' –d ' { "query" : { "hybrid-json" : { "hybrid-json" : { "twig" : { "twig" : { "root" : "funding_rounds" "child" : [{ "root" : "funding_rounds" "child" : [{ "occur" : "must", "node" : { "occur" : "must", "node" : { "query" : "2012", "attribute" : "funded_year" "query" : "2012", "attribute" : "funded_year" } } }] }] } } } } }' }' name : Elasticsearch category_code : search funding_rounds round_code : a raised_amount : 10000000 funded_year : 2012 investments name : Data Collective type : financial-org investments name : ... LOD2 Webinar . 24.06.2014 . Page 31 http://lod2.eu
  • 32. Creating Knowledge out of Interlinked Data Searching: Twig Query $ curl -XGET 'localhost:9200/test/companies/_search' –d ' { "query" : { $ curl -XGET 'localhost:9200/test/companies/_search' –d ' { "query" : { "hybrid-json" : { "hybrid-json" : { "twig" : { "twig" : { "root" : "funding_rounds" "child" : [{ "root" : "funding_rounds" "child" : [{ "occur" : "must", "node" : { "occur" : "must", "node" : { "query" : "2012", "attribute" : "funded_year" "query" : "2012", "attribute" : "funded_year" } } }] }] } } } } }' }' name : Elasticsearch category_code : search funding_rounds round_code : a raised_amount : 10000000 funded_year : 2012 investments name : Data Collective type : financial-org investments name : ... LOD2 Webinar . 24.06.2014 . Page 32 http://lod2.eu
  • 33. Creating Knowledge out of Interlinked Data Searching: Twig Query $ curl -XGET 'localhost:9200/test/companies/_search' –d ' { "query" : { $ curl -XGET 'localhost:9200/test/companies/_search' –d ' { "query" : { "hybrid-json" : { "hybrid-json" : { "twig" : { "twig" : { "root" : "funding_rounds" "child" : [{ "root" : "funding_rounds" "child" : [{ "occur" : "must", "node" : { "occur" : "must", "node" : { "query" : "2012", "attribute" : "funded_year" "query" : "2012", "attribute" : "funded_year" } } }] }] } } } } }' }' name : Elasticsearch category_code : search funding_rounds round_code : a raised_amount : 10000000 funded_year : 2012 investments name : Data Collective type : financial-org investments name : ... LOD2 Webinar . 24.06.2014 . Page 33 http://lod2.eu
  • 34. Creating Knowledge out of Interlinked Data Searching: Proximity Query $ curl -XGET 'localhost:9200/test/companies/_search' –d '{ "query" : { "hybrid-json" : { $ curl -XGET 'localhost:9200/test/companies/_search' –d '{ "query" : { "hybrid-json" : { "boolean" : { "boolean" : { "inOrder" : true, "slop" : 0, "clause" : [{ "inOrder" : true, "slop" : 0, "clause" : [{ "twig" : { "twig" : { "root" : "funding_rounds", "descendant" : [{ Proximity Constraint "root" : "funding_rounds", "descendant" : [{ Order Constraint "level" : 2, "node" : { "query" : "Data AND Collective" } "level" : 2, "node" : { "query" : "Data AND Collective" } }] }] } } },{ },{ "twig" : { "twig" : { "root" : "funding_rounds", "descendant" : [{ "root" : "funding_rounds", "descendant" : [{ "level" : 2, "node" : { "query" : "Index AND Ventures" } "level" : 2, "node" : { "query" : "Index AND Ventures" } }] }] } } }] }] }}}}' }}}}' name : Elasticsearch category_code : search funding_rounds investments name : Data Collective type : financial-org funding_rounds investments name : Index Ventures type : financial-org 11 22 33 44 Boolean Clauses LOD2 Webinar . 24.06.2014 . Page 34 http://lod2.eu
  • 35. Creating Knowledge out of Interlinked Data Searching: Proximity Query $ curl -XGET 'localhost:9200/test/companies/_search' –d '{ "query" : { "hybrid-json" : { $ curl -XGET 'localhost:9200/test/companies/_search' –d '{ "query" : { "hybrid-json" : { "boolean" : { "boolean" : { "inOrder" : true, "slop" : 0, "clause" : [{ "inOrder" : true, "slop" : 0, "clause" : [{ "twig" : { "twig" : { "root" : "funding_rounds", "descendant" : [{ "root" : "funding_rounds", "descendant" : [{ "level" : 2, "node" : { "query" : "Data AND Collective" } "level" : 2, "node" : { "query" : "Data AND Collective" } }] }] } } },{ },{ "twig" : { "twig" : { "root" : "funding_rounds", "descendant" : [{ "root" : "funding_rounds", "descendant" : [{ "level" : 2, "node" : { "query" : "Index AND Ventures" } "level" : 2, "node" : { "query" : "Index AND Ventures" } }] }] } } }] }] }}}}' }}}}' name : Elasticsearch category_code : search funding_rounds investments name : Data Collective type : financial-org funding_rounds investments name : Index Ventures type : financial-org 11 22 33 44 LOD2 Webinar . 24.06.2014 . Page 35 http://lod2.eu
  • 36. Creating Knowledge out of Interlinked Data Searching: Proximity Query $ curl -XGET 'localhost:9200/test/companies/_search' –d '{ "query" : { "hybrid-json" : { $ curl -XGET 'localhost:9200/test/companies/_search' –d '{ "query" : { "hybrid-json" : { "boolean" : { "boolean" : { "inOrder" : true, "slop" : 0, "clause" : [{ "inOrder" : true, "slop" : 0, "clause" : [{ "twig" : { "twig" : { "root" : "funding_rounds", "descendant" : [{ "root" : "funding_rounds", "descendant" : [{ "level" : 2, "node" : { "query" : "Data AND Collective" } "level" : 2, "node" : { "query" : "Data AND Collective" } }] }] } } },{ },{ "twig" : { "twig" : { "root" : "funding_rounds", "descendant" : [{ "root" : "funding_rounds", "descendant" : [{ "level" : 2, "node" : { "query" : "Index AND Ventures" } "level" : 2, "node" : { "query" : "Index AND Ventures" } }] }] } } }] }] }}}}' }}}}' name : Elasticsearch category_code : search funding_rounds investments name : Data Collective type : financial-org funding_rounds investments name : Index Ventures type : financial-org 11 22 33 44 LOD2 Webinar . 24.06.2014 . Page 36 http://lod2.eu
  • 37. Creating Knowledge out of Interlinked Data Searching: Proximity Query $ curl -XGET 'localhost:9200/test/companies/_search' –d '{ "query" : { "hybrid-json" : { $ curl -XGET 'localhost:9200/test/companies/_search' –d '{ "query" : { "hybrid-json" : { "boolean" : { "boolean" : { "inOrder" : true, "slop" : 0, "clause" : [{ "inOrder" : true, "slop" : 0, "clause" : [{ "twig" : { "twig" : { "root" : "funding_rounds", "descendant" : [{ "root" : "funding_rounds", "descendant" : [{ "level" : 2, "node" : { "query" : "Data AND Collective" } "level" : 2, "node" : { "query" : "Data AND Collective" } }] }] } } },{ },{ "twig" : { "twig" : { "root" : "funding_rounds", "descendant" : [{ "root" : "funding_rounds", "descendant" : [{ "level" : 2, "node" : { "query" : "Index AND Ventures" } "level" : 2, "node" : { "query" : "Index AND Ventures" } }] }] } } }] }] }}}}' }}}}' name : Elasticsearch category_code : search funding_rounds investments name : Data Collective type : financial-org funding_rounds investments name : Index Ventures type : financial-org 11 22 33 44 LOD2 Webinar . 24.06.2014 . Page 37 http://lod2.eu
  • 38. Creating Knowledge out of Interlinked Data Demo LOD2 Webinar . 24.06.2014 . Page 38 http://lod2.eu
  • 39. Creating Knowledge out of Interlinked Data Conclusion • SIREn’s Key Feature – Dynamic/Schema-less Data Management – Nested Data – High performance and scalability – Powerful search operators – Elasticsearch/Solr integration • Contact – http://sirendb.com – SindiceTech – info@sindicetech.com LOD2 Webinar . 24.06.2014 . Page 39 http://lod2.eu
  • 40. Creating Knowledge out of Interlinked Data Credits Jingle R.E.M., Martin Kaltenböck, Florian Kondert Coordination Thomas Thurner Martin Kaltenböck Moderation Martin Kaltenböck Presented by Renaud Delbru, Harish Kumar LOD2 Webinar . 24.06.2014 . Page 40 http://lod2.eu
  • 41. Creating Knowledge out of Interlinked Data http://lod2.eu Hope you enjoyed staying with us – if you need more detailed information, visit us at www.lod2.eu and let us know how we can improve to meet your expectations! Don’t forget to register for our next webinar 20.12. 2011 - Virtuoso (Open Link Software) 24.01. 2012 - OntoWiki (University of Leipzig, Germany) Have a great day and don’t forget ... LOD2 Webinar . 24.06.2014 . Page 41 http://lod2.eu
  • 42. Creating Knowledge out of Interlinked Data http://lod2.eu LOD2 Webinar . 24.06.2014 . Page 42 http://lod2.eu

Notas del editor

  1. Binary including full distribution of elasticsearch, with SIREn pre-installed.
  2. Full json document will be indexed both in elasticsearch, and SIREn Need to give an example here