2. The Storyline Ontology
• the problem space
• collaborative model development
• using it with news content
• prepared data for #newsHACK
3. •story telling is
fragmented into
articles
•links between
articles are handcurated by
journalists
•aggregations around
topics (indexes) are
hand-curated
7. 1. annotate assets with topics
Preferred
Label
CPS asset
CPS asset
preferred
Label
about
Slug
slug
mentions
CPS asset
etc
topic
short
Label
disambiguation
Hint
Short
Label
Disambiguation
Hint
8. 2. annotate assets with storylines
CPS asset
CPS asset
Title
about
about
title
Storyline
synopsis
CPS asset
about
Synopsis
9. 3. annotate assets with storylines
and storylines with topics
Title
CPS asset
CPS asset
Synopsis
title
about
about
synopsis
Storyline
topic
CPS asset
Person
about
topic
topic
topic
Organisation
Theme
Place
12. #newsHack
• APIs are ready
• storylines and events have been made
• (tell us if you want more)
• content from multiple sources
Notas del editor
8 minutes on the storyline ontology
explain the problem space – how and did we come up with the storyline model
look at applying the ontology to news content, starting simple then moving to more complex storylines
finally point you at the data we have prepared, and explain how we can make more for you if needed
BBC News publishes ~500 articles + other content types every day
manually maintained topic aggregations (indexes)
manually maintained links between articles
labour intensive
repetitive (much of an article will be repetition - 'clone' in CPS
the story becomes fragmented - many articles and other content types to one storyline
instead why not aggregate content as developments of updates in a storyline
collaboration - a broadcaster, a newspaper, a wire service, a search engine
ran workshops, shared ideas, tested on real data
produced a model, published it under a CC-SA license
most of these guys are here today!
here’s v0.1 the model in it’s entirety
best understood through it’s application to content
take you through four ways the model can be used
1 – basic annotation of content with people, places, orgs, etc
allows dynamic aggregation by topic
BBC already doing this for sport – recent articles about Man Utd
others – Guardian?
2 - as before, but here we have abstracted the Storyline as a concept
‘content about Edward Snowden’s assylum bid’
some News orgs already doing this, eg itv.com/news
3 – taking this firther we can annotate a Storyline with People, Places, Orgs and Themes
allows for more accurate statements about content – not about Chine, but about an event that took place in China
makes for useful onward journeys if we expose these 2-step annotations as links
4 - a Storyline can have Slots
Slots contain Events
Slots can be related to each other (e.g. ‘follows’) to build narrative sequences of Events, or Storylines
this is the space we really want to explore over the NewsHack event
over the past few weeks we have been seeding the BBC News Prototyping platform (Juicer) with these complex (level 4) storylines
we’ve also annotated (‘tagged’) articles with events in these storylines
list of the storylines we have created here:
if there’s a story you love to build an app for that’s not here then tweet us with the #newsHack hash tag and myself or one of the other data monkeys will make it for you