Paul Brock built a search system for scientists at Janseen Scientific Affairs, LLC, subsidiary of Johnson & Johnson. Despite what you’d think by the famous company name, he did not have deep pockets and a large budget, he said at the first Philadelphia Semantic Web Meetup in two years.
Brock, associate director of search solutions for Janseen Scientific Affairs, was able to get a good system up and running by focusing on the things that would deliver the most impact. For example, he created facets for quick searching (similar to what are on most major shopping websites, Zappos being one). The vast majority of his system users make use of the facets for searching. (A technical detail: the facets are stored as a taxonomy in Protégé.)
During the meetup, attendees also heard from Mike Grove, Chief Software Architect for Washington, D.C.-based semantic web company Clark & Parsia.
He discussed a business-motivating aspect of the RDF database Stardog that he built. Instead of encoding business rules in programming code as is normally required, the rules can be encoded separately from the codebase. Since non-programmers can be involved in encoding the rules, programmers and the IT department are no longer a bottleneck in development of the product. In addition, actual experts in business logic can implement the business rules rather than programmers.
Here are some more highlights from the SmartData: Enterprise Semantics and Search meetup:
What is Smart Data vs Big Data?
There’s been a lot of talk about “Big Data.” So what is “Smart Data”? Adding more data isn’t the solution to all problems. Getting value from data comes down to how easily you can do something with it. Smart data is data with semantics attached. Semantics gives data meaning. More specifically it creates data with computer understandable meaning. This means the computer can help. This makes it easier to use the data. Analysis, business intelligence, decision support, etc are all made easier.
What is Stardog?
Stardog is the leading RDF graph database that has semantic search capabilities built-in. Stardog has come a long way in the past few years, like many fast-moving tech solutions. It’s still easy to set up and get working with it. It’s posted some impressive performance gains and now has built-in reasoning capabilities.
Use Cases for semantic reasoning in the enterprise:
Best Buy: Stardog powers Best Buy’s “Like for Like” service (given a product SKU, the most similar products are returned). Stardog also powers searches of the Best Buy product catalog. It’s also used for seasonal promotions searches, like the 2013 Stocking Stuffer promo.
SAIC: Security policies are represented in XACML. These are automatically converted to OWL. Once converted to OWL, the rules are loaded into Stardog and semantic reasoning is used to determine if any inconsistencies exist in the policies. Do the policies do what you think they do? Do any policies allow actions that other policies deny? Are there any policies that are redundant?
NASA: Via Stardog, semantic reasoning infers new connections in the data. This allows them to search existing data to extract information that had previously not been findable. This new search capability saved NASA $38 million a year. It is an official W3C case study for the use of semantic technologies.
History of this type of information science problems/ solutions:
We also got a history lesson on how old some of these information problems and solutions are. For example facet categorization has been around as a concept since the 1930s. It was not a popular way of categorizing data at the time, but in modern computing of course it is used extensively.
For example, most shopping websites use facets as a way for customers to quickly navigate the inventory. These facets will often be represented by a taxonomy or ontology. Once in an ontology, semantic reasoning can be applied to establish any inconsistencies in the ontology and to derive new information.
Small things you can do to improve search results:
There are many pieces of metadata that might seem unimportant, but if known and used will improve search results. For example, knowing authorship of a document can be used to calculate relevance and importance of a document for ranking results.
Before you go...
Please consider supporting Technical.ly to keep our independent journalism strong. Unlike most business-focused media outlets, we don’t have a paywall. Instead, we count on your personal and organizational support.
3 ways to support our work:- Contribute to the Journalism Fund. Charitable giving ensures our information remains free and accessible for residents to discover workforce programs and entrepreneurship pathways. This includes philanthropic grants and individual tax-deductible donations from readers like you.
- Use our Preferred Partners. Our directory of vetted providers offers high-quality recommendations for services our readers need, and each referral supports our journalism.
- Use our services. If you need entrepreneurs and tech leaders to buy your services, are seeking technologists to hire or want more professionals to know about your ecosystem, Technical.ly has the biggest and most engaged audience in the mid-Atlantic. We help companies tell their stories and answer big questions to meet and serve our community.
Join our growing Slack community
Join 5,000 tech professionals and entrepreneurs in our community Slack today!