Thursday, May 26, 2016

You ignore the work on structured stories at your own risk

If you are a journalist (or PR writer), you should read David Caswell's latest about his research into structured stories and follow the links, especially to the structured stories database.

Here's a definition from one of Caswell's earlier atricles
Structured Stories is a form of structured journalism, an approach in which reporting is entered directly into a database and then extracted as needed to create digital news products. Early examples of structured journalism, such as PolitiFact and Homicide Watch D.C., are limited to fixed news items in narrowly defined subject areas. Structured Stories, however, attempts to encode any journalistic news — from any subject area — into structured events and narratives.

Articles are not obsolete in a structured journalism approach but instead are organized within much larger journalistic structures that provide context, coherence and flexibility. These narrative structures are then used to make news stories intelligible to computers and, therefore, available for a variety of digital applications.
This is a not-so-nascent-anymore corner of journalism thought that says, essentially, that the beautifully crafted narrative you just wrote is nothing but data. That fire story you wrote? The address is data, so is the amount of damage, the cause, the type of house, etc. And it's data insurers and others might be willing to pay for.

Adrian Holovaty famously proposed this disaggregation of journalistic stories in 2006. Matt Waite extended it seven years later.
 
Caswell has now been testing the idea on an operational scale in New York, Los Angeles and Missouri (Where he's a Reynolds Fellow).
 
You risk not paying attention to this at your own peril. 

Let me put it this way, how many of you scoffed not that long ago at the concept of computers "writing" journalism stories? How's that worked out?  (A search on "Automated Insights" will fill you in a bit more, such as the AP's wide use of the software, if you haven't been paying attention. Here's some more on AP.)

Now, if you go to Caswell's Structured Stories site and look at some of the work, to those steeped in "storytelling," the examples don't look like much. Bullet points, cards, etc. Certainly not an eye-pleasing "story" (and, let's be honest, we misuse that term a lot anyhow; much of what we do is factual exposition, not story).

But what you are seeing is Holovaty's vision beginning to be turned into reality. And here's why it's important:
  • When you break stories into data, you can repackage that data in many ways and resell it, meaning more streams of income in an era when that's guaranteed to get executives' attention (after all, what other business do you know that leaves more than half of its raw material on the shop floor?).
  • This inevitably means changes in workflows, training and, perhaps, the romantic notion of the storytelling journalist. From another of Caswell's articles: Working with structured information enables the journalist to become like an air traffic controller for news: coordinating, routing, verifying and organizing news as well as identifying gaps in knowledge and filling them by assigning journalistic resources to conduct original reporting. This level of coordination is an impossible, even meaningless, task in a media environment based on text articles, but in a structured media environment it becomes easy and valuable.
  • Caswell says he's shown in real operational situations that structured journalism can be done. 
  • Finally, if you scoff that "people will never read this stuff," I want you to think about two things. First, much of this is not designed to be read by people; it is designed to be read and repurposed by machines. Second, go to the top of any one of those story databases on the Structured Stories site and click on the "told as" drop down menu. Go down to "natural language."  I have not found a natural language version yet, though I haven't gone through all the items,. But take a close look, and what does it say? "Natural language generation by Automated Insights." Uh huh.
Oh, and there's one other. In an era where consumers more and more are accustomed to blending their information and media inputs as they wish, this is the ultimate way for them to do that with journalism.
 
In many ways, this is another aspect of the "semantic web," essentially, the idea of turning much, if not all, of the web into data. Projects such as Open Calais have been grniding along for several years, trying to figure out how to turn all this unstructured information (in our business = story) into structured data.

So whether the prospect makes you recoil or not, it's time to start paying attention. Your future in journalism (and PR) probably depends on it.


Labels: , , , , , , ,

SC legslators suggest Wikipedia will do over those expensive databases

(Update: 1:40 p.m. 3/26: Ron Aiken of The Nerve says the language was stricken in conference committee last night but that the sponsor, Rep. Leon Stavrinakis, D-Charleston, says he'll bring it up again next session.)

S.C.'s State Library apparently is up in arms about some legislative budget engineering that would put some requirements and restrictions on the statewide DISCUS system, the free digital library available to everyone in the state and probably one of the state's best (if somewhat hidden) resources.

First there was a House budget proviso that would have prohibited the library from licensing electronic sources "where the same information is easily found in free online products such as Wikipedia." (Oh, there's a reliable source, eh?) It also would have prohibited licensing databases of articles "from mainstream newspapers and magazines, as these can almost always be accessed free online and are easily discovered through Internet search engines."

That same proviso also would have prohibited the inclusion of scholarly articles as not "intellectually accessible to the general population," but that was stricken -- as was the whole proviso.

But now the House has amended the Senate version to insert a new proviso that says no database DISCUS buys can have more than 20% of material freely available online.

There also are a bunch of technical requirements, such as that all databases must have responsive design that allows them to be viewable "down to the smallest smartphone size" and that there be an extensive geolocation service for all users. Video would also have to be delivered as H.264, MPEG-4 AVC format.

So in theory the responsive design requirement is a good one -- but will that put valuable databases/info off limits?

If you are out of state (or even on the border and your cell signal is being picked up by a tower in Georgia or NC) does that mean no access?

Sure, H.264 AVC is the advanced standard now, but things don't change much in tech, do they? So how quickly, if this is specified in state law, will it become outdated?

Generally, the success of legislating specific technology requirements has not gone well through the years.

To see the State Library's take on all this and the source docs:

Labels: , ,

Monday, May 02, 2016

Wishful thinking, newspaper edition

From the wishful thinking dept. at Editor and Publisher.:
Returning to print shouldn’t be seen as taking a step back. Many readers still rely heavily on the print edition. A Pew Research Center study found that around half of newspaper readers in three U.S. metropolitan cities (Denver, Colo., Macon, Ga., and Sioux City, Iowa) only read in print.

With the saturation of news, the toxicity of online harassment, and the amount of poor Web experiences, readers will soon want to come back to print. This resurgence must take place if we want to keep print around for many more years, and publishers can accomplish that by immersing readers—not with virtual reality headsets—but with ink on their hands.
That's at the bottom of a mishmash, way-too-long piece that tries to make the case that poor woeful newspapers are being victimized again by technology, this time ad blockers (BTW, there's an easy way to get around Forbes' ad-blocking message and many other publishers').

That Pew statistic? It's a nice way of deception. Remember, it says half of all newspaper readers. It doesn't say what's happening to the overall number of newspaper readers (in other words, if there are still two newspaper readers and one reads only in print, you've met that stat -- but it's hardly a viable business model).

I'm a big fan of "newspapers" if you mean the term to refer to robust news orgs. If you mean it to refer to ink on paper, however, I'd like to introduce you to the dozens of students I interact with every semester. You know, the future higher income, higher educated readers your advertisers want. "Newspaper" is not in their daily universe.

This, of course, from the same people who have been telling themselves for years that as people age and buy houses, have kids, etc., they'll start reading newspapers -- despite every bit of solid social science research that's debunked that.

So how's that working out?

Labels: , , ,