Unleashing Unconstrained News Knowledge Graphs to Combat Misinformation // Robert Caulk // #279
Robert Caulk is responsible for directing software development, enabling research, coordinating company projects, quality control, proposing external collaborations, and securing funding. He believes firmly in open-source, having spent 12 years accruing over 1000 academic citations building open-source software in domains such as machine learning, image analysis, and coupled physical processes. He received his Ph.D. from Université Grenoble Alpes, France, in computational mechanics.
Unleashing Unconstrained News Knowledge Graphs to Combat Misinformation // MLOps Podcast #279 with Robert Caulk, Founder of Emergent Methods.
// Abstract
Indexing hundreds of thousands of news articles per day into a knowledge graph (KG) was previously impossible due to the strict requirement that high-level reasoning, general world knowledge, and full-text context *must* be present for proper KG construction.
The latest tools now enable such general world knowledge and reasoning to be applied cost effectively to high-volumes of news articles. Beyond the low cost of processing these news articles, these tools are also opening up a new, controversial, approach to KG building - unconstrained KGs.
We discuss the construction and exploration of the largest news-knowledge-graph on the planet - hosted on an endpoint at AskNews.app. During talk we aim to highlight some of the sacrifices and benefits that go hand-in-hand with using the infamous unconstrained KG approach.
We conclude the talk by explaining how knowledge graphs like these help to mitigate misinformation. We provide some examples of how our clients are using this graph, such as generating sports forecasts, generating better social media posts, generating regional security alerts, and combating human trafficking.
// Bio
Robert is the founder of Emergent Methods, where he directs research and software development for large-scale applications. He is currently overseeing the structuring of hundreds of thousands of news articles per day in order to build the best news retrieval API in the world: https://asknews.app.
// MLOps Swag/Merch
https://shop.mlops.community/
// Related Links
Website: https://emergentmethods.ai
News Retrieval API: https://asknews.app
--------------- ✌️Connect With Us ✌️ -------------
Join our slack community: https://go.mlops.community/slack
Follow us on Twitter: @mlopscommunity
Sign up for the next meetup: https://go.mlops.community/register
Catch all episodes, blogs, newsletters, and more: https://mlops.community/
Connect with Demetrios on LinkedIn: https://www.linkedin.com/in/dpbrinkm/
Connect with Rob on LinkedIn: https://www.linkedin.com/in/rcaulk/
Timestamps:
[00:00] Rob's preferred coffee
[00:05] Takeaways
[00:55] Please like, share, leave a review, and subscribe to our MLOps channels!
[01:00] Join our Local Organizer Carousel!
[02:15] Knowledge Graphs and ontology
[07:43] Ontology vs Noun Approach
[12:46] Ephemeral tools for efficiency
[17:26] Oracle to PostgreSQL migration
[22:20] MEM Graph life cycle
[29:14] Knowledge Graph Investigation Insights
[33:37] Fine-tuning and distillation of LLMs
[39:28] DAG workflow and quality control
[46:23] Crawling nodes with Phi 3 Llama
[50:05] AI pricing risks and strategies
[56:14] Data labeling and poisoning
[58:34] API costs vs News latency
[1:02:10] Product focus and value
[1:04:52] Ensuring reliable information
[1:11:01] Podcast transcripts as News
[1:13:08] Ontology trade-offs explained
[1:15:00] Wrap up