<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Edgar Meij</title>
	<atom:link href="https://edgar.meij.pro/feed/" rel="self" type="application/rss+xml" />
	<link>https://edgar.meij.pro/</link>
	<description>semantic search research ッ</description>
	<lastBuildDate>Fri, 08 Sep 2023 08:49:36 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>
	hourly	</sy:updatePeriod>
	<sy:updateFrequency>
	1	</sy:updateFrequency>
	

<image>
	<url>https://edgar.meij.pro/wp-content/uploads/2020/06/226193-512-144x144.jpeg</url>
	<title>Edgar Meij</title>
	<link>https://edgar.meij.pro/</link>
	<width>32</width>
	<height>32</height>
</image> 
	<item>
		<title>Dense Retrieval Adaptation using Target Domain Description (ICTIR 2023)</title>
		<link>https://edgar.meij.pro/dense-retrieval-adaptation-target-domain-description-ictir-2023/</link>
					<comments>https://edgar.meij.pro/dense-retrieval-adaptation-target-domain-description-ictir-2023/#respond</comments>
		
		<dc:creator><![CDATA[Edgar Meij]]></dc:creator>
		<pubDate>Sun, 23 Jul 2023 08:46:39 +0000</pubDate>
				<category><![CDATA[Blog]]></category>
		<category><![CDATA[Conference Papers]]></category>
		<category><![CDATA[Publications]]></category>
		<guid isPermaLink="false">https://edgar.meij.pro/?p=1509</guid>

					<description><![CDATA[<p>In information retrieval (IR), domain adaptation is the process of adapting a retrieval model to a new domain whose data distribution is different from the source domain. Existing methods in this area focus on unsupervised domain adaptation where they have access to the target document collection or supervised (often few-shot)&#8230;</p>
<p>The post <a href="https://edgar.meij.pro/dense-retrieval-adaptation-target-domain-description-ictir-2023/">Dense Retrieval Adaptation using Target Domain Description (ICTIR 2023)</a> appeared first on <a href="https://edgar.meij.pro">Edgar Meij</a>.</p>
]]></description>
										<content:encoded><![CDATA[<p>In information retrieval (IR), domain adaptation is the process of<br />
adapting a retrieval model to a new domain whose data distribution<br />
is different from the source domain. Existing methods in this area<br />
focus on unsupervised domain adaptation where they have access<br />
to the target document collection or supervised (often few-shot)<br />
domain adaptation where they additionally have access to (limited)<br />
labeled data in the target domain. There also exists research on<br />
improving zero-shot performance of retrieval models with no adaptation. This paper introduces a new category of domain adaptation<br />
in IR that is as-yet unexplored. Here, similar to the zero-shot setting, we assume the retrieval model does not have access to the<br />
target document collection. In contrast, it does have access to a<br />
brief textual description that explains the target domain. We define<br />
a taxonomy of domain attributes in retrieval tasks to understand<br />
different properties of a source domain that can be adapted to a<br />
target domain. We introduce a novel automatic data construction<br />
pipeline that produces a synthetic document collection, query set,<br />
and pseudo relevance labels, given a textual domain description.<br />
Extensive experiments on five diverse target domains show that<br />
adapting dense retrieval models using the constructed synthetic<br />
data leads to effective retrieval performance on the target domain.</p>
<p>The post <a href="https://edgar.meij.pro/dense-retrieval-adaptation-target-domain-description-ictir-2023/">Dense Retrieval Adaptation using Target Domain Description (ICTIR 2023)</a> appeared first on <a href="https://edgar.meij.pro">Edgar Meij</a>.</p>
]]></content:encoded>
					
					<wfw:commentRss>https://edgar.meij.pro/dense-retrieval-adaptation-target-domain-description-ictir-2023/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
			</item>
		<item>
		<title>ECIR 23 Tutorial: Neuro-Symbolic Approaches for Information Retrieval</title>
		<link>https://edgar.meij.pro/ecir-23-tutorial-neuro-symbolic-approaches-information-retrieval/</link>
					<comments>https://edgar.meij.pro/ecir-23-tutorial-neuro-symbolic-approaches-information-retrieval/#respond</comments>
		
		<dc:creator><![CDATA[Edgar Meij]]></dc:creator>
		<pubDate>Wed, 08 Mar 2023 09:45:18 +0000</pubDate>
				<category><![CDATA[Blog]]></category>
		<category><![CDATA[Publications]]></category>
		<category><![CDATA[Tutorial]]></category>
		<guid isPermaLink="false">https://edgar.meij.pro/?p=1507</guid>

					<description><![CDATA[<p>This tutorial will provide an overview of recent advances on neuro-symbolic approaches for information retrieval. A decade ago, knowledge graphs and semantic annotations technology led to active research on how to best leverage symbolic knowledge. At the same time, neural methods have demonstrated to be versatile and highly effective. From&#8230;</p>
<p>The post <a href="https://edgar.meij.pro/ecir-23-tutorial-neuro-symbolic-approaches-information-retrieval/">ECIR 23 Tutorial: Neuro-Symbolic Approaches for Information Retrieval</a> appeared first on <a href="https://edgar.meij.pro">Edgar Meij</a>.</p>
]]></description>
										<content:encoded><![CDATA[<p>This tutorial will provide an overview of recent advances<br />
on neuro-symbolic approaches for information retrieval. A decade ago,<br />
knowledge graphs and semantic annotations technology led to active research on how to best leverage symbolic knowledge. At the same time,<br />
neural methods have demonstrated to be versatile and highly effective.<br />
From a neural network perspective, the same representation approach<br />
can service document ranking or knowledge graph reasoning. End-to-end<br />
training allows to optimize complex methods for downstream tasks.<br />
We are at the point where both the symbolic and the neural research<br />
advances are coalescing into neuro-symbolic approaches. The underlying<br />
research questions are how to best combine symbolic and neural approaches, what kind of symbolic/neural approaches are most suitable for<br />
which use case, and how to best integrate both ideas to advance the state<br />
of the art in information retrieval.</p>
<p>The post <a href="https://edgar.meij.pro/ecir-23-tutorial-neuro-symbolic-approaches-information-retrieval/">ECIR 23 Tutorial: Neuro-Symbolic Approaches for Information Retrieval</a> appeared first on <a href="https://edgar.meij.pro">Edgar Meij</a>.</p>
]]></content:encoded>
					
					<wfw:commentRss>https://edgar.meij.pro/ecir-23-tutorial-neuro-symbolic-approaches-information-retrieval/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
			</item>
		<item>
		<title>Entity Retrieval from Multilingual Knowledge Graphs (MRL 2022)</title>
		<link>https://edgar.meij.pro/entity-retrieval-multilingual-knowledge-graphs-mrl-2022/</link>
					<comments>https://edgar.meij.pro/entity-retrieval-multilingual-knowledge-graphs-mrl-2022/#respond</comments>
		
		<dc:creator><![CDATA[Edgar Meij]]></dc:creator>
		<pubDate>Thu, 08 Dec 2022 09:44:01 +0000</pubDate>
				<category><![CDATA[Blog]]></category>
		<category><![CDATA[Conference Papers]]></category>
		<category><![CDATA[Publications]]></category>
		<guid isPermaLink="false">https://edgar.meij.pro/?p=1505</guid>

					<description><![CDATA[<p>Knowledge Graphs (KGs) are structured databases that capture real-world entities and their relationships. The task of entity retrieval from a KG aims at retrieving a ranked list of entities relevant to a given user query. While English-only entity retrieval has attracted considerable attention, user queries, as well as the information&#8230;</p>
<p>The post <a href="https://edgar.meij.pro/entity-retrieval-multilingual-knowledge-graphs-mrl-2022/">Entity Retrieval from Multilingual Knowledge Graphs (MRL 2022)</a> appeared first on <a href="https://edgar.meij.pro">Edgar Meij</a>.</p>
]]></description>
										<content:encoded><![CDATA[<p>Knowledge Graphs (KGs) are structured databases that capture real-world entities and their relationships. The task of entity retrieval from a KG aims at retrieving a ranked list of entities relevant to a given user query. While English-only entity retrieval has attracted considerable attention, user queries, as well as the information contained in the KG, may be represented in multiple—and possibly distinct—languages. Furthermore, KG content may vary between languages due to different information sources and points of view. Recent advances in language representation have enabled natural ways of bridging gaps between languages. In this paper, we therefore propose to utilise language models (LMs) and diverse entity representations to enable truly multilingual entity retrieval. We propose two approaches:(i) an array of monolingual retrievers and (ii) a single multilingual retriever, trained using queries and documents in multiple languages. We show that while our approach is on par with the significantly more complex state-of-the-art method for the English task, it can be successfully applied to virtually any language with a LM. Furthermore, it allows languages to benefit from one another, yielding significantly better performance, both for low-and high-resource languages.</p>
<p>The post <a href="https://edgar.meij.pro/entity-retrieval-multilingual-knowledge-graphs-mrl-2022/">Entity Retrieval from Multilingual Knowledge Graphs (MRL 2022)</a> appeared first on <a href="https://edgar.meij.pro">Edgar Meij</a>.</p>
]]></content:encoded>
					
					<wfw:commentRss>https://edgar.meij.pro/entity-retrieval-multilingual-knowledge-graphs-mrl-2022/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
			</item>
		<item>
		<title>Similarity-based Multi-Domain Dialogue State Tracking with Copy Mechanisms for Task-based Virtual Personal Assistants (WWW 2022)</title>
		<link>https://edgar.meij.pro/similarity-based-multi-domain-dialogue-state-tracking-copy-mechanisms-task-based-virtual-personal-assistants-www-2022/</link>
					<comments>https://edgar.meij.pro/similarity-based-multi-domain-dialogue-state-tracking-copy-mechanisms-task-based-virtual-personal-assistants-www-2022/#respond</comments>
		
		<dc:creator><![CDATA[Edgar Meij]]></dc:creator>
		<pubDate>Sat, 09 Apr 2022 08:43:28 +0000</pubDate>
				<category><![CDATA[Blog]]></category>
		<category><![CDATA[Conference Papers]]></category>
		<category><![CDATA[Publications]]></category>
		<guid isPermaLink="false">https://edgar.meij.pro/?p=1503</guid>

					<description><![CDATA[<p>Task-based Virtual Personal Assistants (VPAs) rely on multi-domain Dialogue State Tracking (DST) models to monitor goals throughout a conversation. Previously proposed models show promising results on established benchmarks, but they have difficulty adapting to unseen domains due to domain-specific parameters in their model architectures. We propose a new Similarity-based Multi-domain&#8230;</p>
<p>The post <a href="https://edgar.meij.pro/similarity-based-multi-domain-dialogue-state-tracking-copy-mechanisms-task-based-virtual-personal-assistants-www-2022/">Similarity-based Multi-Domain Dialogue State Tracking with Copy Mechanisms for Task-based Virtual Personal Assistants (WWW 2022)</a> appeared first on <a href="https://edgar.meij.pro">Edgar Meij</a>.</p>
]]></description>
										<content:encoded><![CDATA[<p>Task-based Virtual Personal Assistants (VPAs) rely on multi-domain<br />
Dialogue State Tracking (DST) models to monitor goals throughout<br />
a conversation. Previously proposed models show promising results<br />
on established benchmarks, but they have difficulty adapting to<br />
unseen domains due to domain-specific parameters in their model<br />
architectures. We propose a new Similarity-based Multi-domain Dialogue State Tracking model (SM-DST) that uses retrieval-inspired<br />
and fine-grained contextual token-level similarity approaches to<br />
efficiently and effectively track dialogue state. The key difference<br />
with state-of-the-art DST models is that SM-DST has a single model<br />
with shared parameters across domains and slots. Because we base<br />
SM-DST on similarity it allows the transfer of tracking information between semantically related domains as well as to unseen<br />
domains without retraining. Furthermore, we leverage copy mechanisms that consider the system’s response and the dialogue state<br />
from previous turn predictions, allowing it to more effectively track<br />
dialogue state for complex conversations. We evaluate SM-DST<br />
on three variants of the MultiWOZ DST benchmark datasets. The<br />
results demonstrate that SM-DST significantly and consistently<br />
outperforms state-of-the-art models across all datasets by absolute<br />
5-18% and 3-25% in the few- and zero-shot settings, respectively.</p>
<p>The post <a href="https://edgar.meij.pro/similarity-based-multi-domain-dialogue-state-tracking-copy-mechanisms-task-based-virtual-personal-assistants-www-2022/">Similarity-based Multi-Domain Dialogue State Tracking with Copy Mechanisms for Task-based Virtual Personal Assistants (WWW 2022)</a> appeared first on <a href="https://edgar.meij.pro">Edgar Meij</a>.</p>
]]></content:encoded>
					
					<wfw:commentRss>https://edgar.meij.pro/similarity-based-multi-domain-dialogue-state-tracking-copy-mechanisms-task-based-virtual-personal-assistants-www-2022/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
			</item>
		<item>
		<title>Understanding Financial Information Seeking Behavior from User Interactions with Company Filings (WWW companion 2022)</title>
		<link>https://edgar.meij.pro/understanding-financial-information-seeking-behavior-user-interactions-company-filings-www-companion-2022/</link>
					<comments>https://edgar.meij.pro/understanding-financial-information-seeking-behavior-user-interactions-company-filings-www-companion-2022/#respond</comments>
		
		<dc:creator><![CDATA[Edgar Meij]]></dc:creator>
		<pubDate>Tue, 05 Apr 2022 08:42:39 +0000</pubDate>
				<category><![CDATA[Blog]]></category>
		<category><![CDATA[Conference Papers]]></category>
		<category><![CDATA[Publications]]></category>
		<guid isPermaLink="false">https://edgar.meij.pro/?p=1501</guid>

					<description><![CDATA[<p>Publicly-traded companies are required to regularly file financial statements and disclosures. Analysts, investors, and regulators leverage these filings to support decision making, with high financial and legal stakes. Despite their ubiquity in finance, little is known about the information seeking behavior of users accessing such filings. In this work, we&#8230;</p>
<p>The post <a href="https://edgar.meij.pro/understanding-financial-information-seeking-behavior-user-interactions-company-filings-www-companion-2022/">Understanding Financial Information Seeking Behavior from User Interactions with Company Filings (WWW companion 2022)</a> appeared first on <a href="https://edgar.meij.pro">Edgar Meij</a>.</p>
]]></description>
										<content:encoded><![CDATA[<p>Publicly-traded companies are required to regularly file financial<br />
statements and disclosures. Analysts, investors, and regulators<br />
leverage these filings to support decision making, with high financial and legal stakes. Despite their ubiquity in finance, little is<br />
known about the information seeking behavior of users accessing<br />
such filings. In this work, we present the first study of this behavior.<br />
We analyze 14 years of logs of users accessing company filings<br />
of more than 600K distinct companies on the U.S. Securities and<br />
Exchange Commission’s (SEC) Electronic Data Gathering, Analysis,<br />
and Retrieval (EDGAR) system, the primary resource for accessing<br />
company filings. We provide an analysis of the information-seeking<br />
behavior for this high-impact domain. We find that little behavioral history is available for the majority of users, while frequent<br />
users have rich histories. Most sessions focus on filings belonging<br />
to a small number of companies, and individual users are interested in a limited number of companies. Out of all sessions, 66%<br />
contain filings from one or two companies and 50% of frequent<br />
users are interested in six companies or less. Understanding user<br />
interactions with EDGAR can suggest ways to enhance the user<br />
journey in browsing filings, e.g., via filing recommendation. Our<br />
work provides a stepping stone for the academic community to<br />
tackle retrieval and recommendation tasks for the finance domain.</p>
<p>The post <a href="https://edgar.meij.pro/understanding-financial-information-seeking-behavior-user-interactions-company-filings-www-companion-2022/">Understanding Financial Information Seeking Behavior from User Interactions with Company Filings (WWW companion 2022)</a> appeared first on <a href="https://edgar.meij.pro">Edgar Meij</a>.</p>
]]></content:encoded>
					
					<wfw:commentRss>https://edgar.meij.pro/understanding-financial-information-seeking-behavior-user-interactions-company-filings-www-companion-2022/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
			</item>
		<item>
		<title>Recent papers</title>
		<link>https://edgar.meij.pro/recent-papers/</link>
					<comments>https://edgar.meij.pro/recent-papers/#respond</comments>
		
		<dc:creator><![CDATA[Edgar Meij]]></dc:creator>
		<pubDate>Sun, 02 Jan 2022 17:47:00 +0000</pubDate>
				<category><![CDATA[Blog]]></category>
		<category><![CDATA[Conference Papers]]></category>
		<category><![CDATA[Journal Papers]]></category>
		<category><![CDATA[Publications]]></category>
		<category><![CDATA[Workshop Papers]]></category>
		<guid isPermaLink="false">https://edgar.meij.pro/?p=1476</guid>

					<description><![CDATA[<p>Please look at Google Scholar to see the list of my most recept up-to-date papers, and be sure to check out https://techatbloomberg.com/ai as well!</p>
<p>The post <a href="https://edgar.meij.pro/recent-papers/">Recent papers</a> appeared first on <a href="https://edgar.meij.pro">Edgar Meij</a>.</p>
]]></description>
										<content:encoded><![CDATA[
<p>Please look at <a href="https://scholar.google.com/citations?hl=en&amp;user=vb6YKF0AAAAJ&amp;view_op=list_works&amp;sortby=pubdate">Google Scholar</a> to see the list of my most recept up-to-date papers, and be sure to check out <a href="https://techatbloomberg.com/ai">https://techatbloomberg.com/ai</a> as well!</p>
<p>The post <a href="https://edgar.meij.pro/recent-papers/">Recent papers</a> appeared first on <a href="https://edgar.meij.pro">Edgar Meij</a>.</p>
]]></content:encoded>
					
					<wfw:commentRss>https://edgar.meij.pro/recent-papers/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
			</item>
		<item>
		<title>Improving Dialogue State Tracking with Turn-based Loss Function and Sequential Data Augmentation (EMNLP 2021)</title>
		<link>https://edgar.meij.pro/improving-dialogue-state-tracking-turn-based-loss-function-sequential-data-augmentation-emnlp-2021/</link>
					<comments>https://edgar.meij.pro/improving-dialogue-state-tracking-turn-based-loss-function-sequential-data-augmentation-emnlp-2021/#respond</comments>
		
		<dc:creator><![CDATA[Edgar Meij]]></dc:creator>
		<pubDate>Mon, 08 Nov 2021 09:41:12 +0000</pubDate>
				<category><![CDATA[Blog]]></category>
		<category><![CDATA[Conference Papers]]></category>
		<category><![CDATA[Publications]]></category>
		<guid isPermaLink="false">https://edgar.meij.pro/?p=1499</guid>

					<description><![CDATA[<p>While state-of-the-art Dialogue State Tracking (DST) models show promising results, all of them rely on a traditional cross-entropy loss function during the training process, which may not be optimal for improving the joint goal accuracy. Although several approaches recently propose augmenting the training set by copying user utterances and replacing&#8230;</p>
<p>The post <a href="https://edgar.meij.pro/improving-dialogue-state-tracking-turn-based-loss-function-sequential-data-augmentation-emnlp-2021/">Improving Dialogue State Tracking with Turn-based Loss Function and Sequential Data Augmentation (EMNLP 2021)</a> appeared first on <a href="https://edgar.meij.pro">Edgar Meij</a>.</p>
]]></description>
										<content:encoded><![CDATA[<p>While state-of-the-art Dialogue State Tracking (DST) models show promising results, all of them rely on a traditional cross-entropy loss function during the training process, which may not be optimal for improving the joint goal accuracy. Although several approaches recently propose augmenting the training set by copying user utterances and replacing the real slot values with other possible or even similar values, they are not effective at improving the performance of existing DST models. To address these challenges, we propose a Turn-based Loss Function (TLF) that penalises the model if it inaccurately predicts a slot value at the early turns more so than in later turns in order to improve joint goal accuracy. We also propose a simple but effective Sequential Data Augmentation (SDA) algorithm to generate more complex user utterances and system responses to effectively train existing DST models. Experimental results on two standard DST benchmark collections demonstrate that our proposed TLF and SDA techniques significantly improve the effectiveness of the state-of-the-art DST model by approximately 7-8% relative reduction in error and achieves a new state-of-the-art joint goal accuracy with 59.50 and 54.90 on MultiWOZ2. 1 and MultiWOZ2. 2, respectively.</p>
<p>The post <a href="https://edgar.meij.pro/improving-dialogue-state-tracking-turn-based-loss-function-sequential-data-augmentation-emnlp-2021/">Improving Dialogue State Tracking with Turn-based Loss Function and Sequential Data Augmentation (EMNLP 2021)</a> appeared first on <a href="https://edgar.meij.pro">Edgar Meij</a>.</p>
]]></content:encoded>
					
					<wfw:commentRss>https://edgar.meij.pro/improving-dialogue-state-tracking-turn-based-loss-function-sequential-data-augmentation-emnlp-2021/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
			</item>
		<item>
		<title>News Article Retrieval in Context for Event-centric Narrative Creation</title>
		<link>https://edgar.meij.pro/news-article-retrieval-in-contextfor-event-centric-narrative-creation/</link>
					<comments>https://edgar.meij.pro/news-article-retrieval-in-contextfor-event-centric-narrative-creation/#respond</comments>
		
		<dc:creator><![CDATA[Edgar Meij]]></dc:creator>
		<pubDate>Sun, 11 Jul 2021 16:47:00 +0000</pubDate>
				<category><![CDATA[Blog]]></category>
		<category><![CDATA[Conference Papers]]></category>
		<category><![CDATA[Publications]]></category>
		<guid isPermaLink="false">https://edgar.meij.pro/?p=1475</guid>

					<description><![CDATA[<p>Writers such as journalists often use automatic tools to find relevant content to include in their narratives. In this paper, we focus on supporting writers in the news domain to develop event-centric narratives. Given an incomplete narrative that specifies a main event and a context, we aim to retrieve news&#8230;</p>
<p>The post <a href="https://edgar.meij.pro/news-article-retrieval-in-contextfor-event-centric-narrative-creation/">News Article Retrieval in Context for Event-centric Narrative Creation</a> appeared first on <a href="https://edgar.meij.pro">Edgar Meij</a>.</p>
]]></description>
										<content:encoded><![CDATA[


<p>Writers such as journalists often use automatic tools to find relevant content to include in their narratives. In this paper, we focus on supporting writers in the news domain to develop event-centric narratives. Given an incomplete narrative that specifies a main event and a context, we aim to retrieve news articles that discuss relevant events that would enable the continuation of the narrative. We formally define this task and propose a retrieval dataset construction procedure that relies on existing news articles to simulate incomplete narratives and relevant articles. Experiments on two datasets derived from this procedure show that state-of-the-art lexical and semantic rankers are not sufficient for this task. We show that combining those with a ranker that ranks articles by reverse chronological order outperforms those rankers alone. We also perform an in-depth quantitative and qualitative analysis of the results that sheds light on the characteristics of this task.</p>



<p>See <a href="https://doi.org/10.1145/3471158.3472247">https://doi.org/10.1145/3471158.3472247</a> for more details.</p>
<p>The post <a href="https://edgar.meij.pro/news-article-retrieval-in-contextfor-event-centric-narrative-creation/">News Article Retrieval in Context for Event-centric Narrative Creation</a> appeared first on <a href="https://edgar.meij.pro">Edgar Meij</a>.</p>
]]></content:encoded>
					
					<wfw:commentRss>https://edgar.meij.pro/news-article-retrieval-in-contextfor-event-centric-narrative-creation/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
			</item>
		<item>
		<title>Contextualizing Trending Entities in News Stories</title>
		<link>https://edgar.meij.pro/contextualizing-trending-entities-in-news-stories/</link>
					<comments>https://edgar.meij.pro/contextualizing-trending-entities-in-news-stories/#respond</comments>
		
		<dc:creator><![CDATA[Edgar Meij]]></dc:creator>
		<pubDate>Mon, 01 Mar 2021 10:39:00 +0000</pubDate>
				<category><![CDATA[Blog]]></category>
		<category><![CDATA[Conference Papers]]></category>
		<category><![CDATA[Publications]]></category>
		<category><![CDATA[Information retrieval]]></category>
		<category><![CDATA[Learning to rank]]></category>
		<category><![CDATA[trend detection]]></category>
		<guid isPermaLink="false">https://edgar.meij.pro/?p=1438</guid>

					<description><![CDATA[<p>Trends are those keywords, phrases, or names that are mentioned most often on social media or in news in a particular timeframe.They are an effective way for human news readers to both discover and stay focused on the most relevant information of the day. In this work, we consider trends&#8230;</p>
<p>The post <a href="https://edgar.meij.pro/contextualizing-trending-entities-in-news-stories/">Contextualizing Trending Entities in News Stories</a> appeared first on <a href="https://edgar.meij.pro">Edgar Meij</a>.</p>
]]></description>
										<content:encoded><![CDATA[
<p>Trends are those keywords, phrases, or names that are mentioned most often on social media or in news in a particular timeframe.They are an effective way for human news readers to both discover and stay focused on the most relevant information of the day. In this work, we consider trends that correspond to an entity in a knowledge graph and introduce the new and as-yet unexplored task of identifying other entities that may help explain the &#8220;why&#8221; an entity is trending. We refer to these retrieved entities as contextual entities. Some of them are more important than others in the context of the trending entity and we thus determine a ranking of entities according to how useful they are in contextualizing the trend. We propose two solutions for ranking contextual entities. The first one is fully unsupervised and based on Personalized PageRank, calculated over a trending entity-specific graph of other entities where the edges encode a notion of directional similarity based on embedded background knowledge. Our second method is based on learning to rank and combines the intuitions behind the unsupervised model with signals derived from hand-crafted features in a supervised setting. We compare our models on this novel task by using a new, purpose-built test collection created using crowdsourcing. Our methods improve over the strongest baseline in terms of Precision at 1 by 7% (unsupervised) and 13% (supervised). We find that the salience of a contextual entity and how coherent it is with respect to the news story are strong indicators of relevance in both unsupervised and supervised settings.</p>



<p>See <a href="https://doi.org/10.1145/3437963.3441765">https://doi.org/10.1145/3437963.3441765</a>.</p>
<p>The post <a href="https://edgar.meij.pro/contextualizing-trending-entities-in-news-stories/">Contextualizing Trending Entities in News Stories</a> appeared first on <a href="https://edgar.meij.pro">Edgar Meij</a>.</p>
]]></content:encoded>
					
					<wfw:commentRss>https://edgar.meij.pro/contextualizing-trending-entities-in-news-stories/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
			</item>
		<item>
		<title>Report on the first workshop on bias in automatic knowledge graph construction at AKBC 2020</title>
		<link>https://edgar.meij.pro/report-workshop-bias-automatic-knowledge-graph-construction-akbc-2020/</link>
					<comments>https://edgar.meij.pro/report-workshop-bias-automatic-knowledge-graph-construction-akbc-2020/#respond</comments>
		
		<dc:creator><![CDATA[Edgar Meij]]></dc:creator>
		<pubDate>Tue, 08 Dec 2020 09:38:47 +0000</pubDate>
				<category><![CDATA[Blog]]></category>
		<category><![CDATA[Journal Papers]]></category>
		<category><![CDATA[Publications]]></category>
		<guid isPermaLink="false">https://edgar.meij.pro/?p=1497</guid>

					<description><![CDATA[<p>We report on the First Workshop on Bias in Automatic Knowledge Graph Construction (KG-BIAS), which was co-located with the Automated Knowledge Base Construction (AKBC) 2020 conference. Identifying and possibly remediating any sort of bias in knowledge graphs, or in the methods used to construct or query them, has clear implications&#8230;</p>
<p>The post <a href="https://edgar.meij.pro/report-workshop-bias-automatic-knowledge-graph-construction-akbc-2020/">Report on the first workshop on bias in automatic knowledge graph construction at AKBC 2020</a> appeared first on <a href="https://edgar.meij.pro">Edgar Meij</a>.</p>
]]></description>
										<content:encoded><![CDATA[<p>We report on the First Workshop on Bias in Automatic Knowledge Graph Construction (KG-BIAS), which was co-located with the Automated Knowledge Base Construction (AKBC) 2020 conference. Identifying and possibly remediating any sort of bias in knowledge graphs, or in the methods used to construct or query them, has clear implications for downstream systems accessing and using the information in such graphs. However, this topic remains relatively unstudied, so our main aim for organizing this workshop was to bring together a group of people from a variety of backgrounds with an interest in the topic, in order to arrive at a shared definition and roadmap for the future. Through a program that included two keynotes, an invited paper, three peer-reviewed full papers, and a plenary discussion, we have made initial inroads towards a common understanding and shared research agenda for this timely and important topic.</p>
<p>The post <a href="https://edgar.meij.pro/report-workshop-bias-automatic-knowledge-graph-construction-akbc-2020/">Report on the first workshop on bias in automatic knowledge graph construction at AKBC 2020</a> appeared first on <a href="https://edgar.meij.pro">Edgar Meij</a>.</p>
]]></content:encoded>
					
					<wfw:commentRss>https://edgar.meij.pro/report-workshop-bias-automatic-knowledge-graph-construction-akbc-2020/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
			</item>
	</channel>
</rss>
