Paper Digest: CIKM 2020 Highlights
The ACM Conference on Information and Knowledge Management (CIKM) is an annual computer science research conference dedicated to information management and knowledge management. In 2020, it is to be held virtually due to covid-19 pandemic.
To help the community quickly catch up on the work presented in this conference, Paper Digest Team processed all accepted papers, and generated one highlight sentence (typically the main topic) for each paper. Readers are encouraged to read these machine generated highlights / summaries to quickly get the main idea of each paper.
If you do not want to miss any interesting academic paper, you are welcome to sign up our free daily paper digest service to get updates on new papers published in your area every day. You are also welcome to follow us on Twitter and Linkedin to get updated with new conference digests.
Paper Digest Team
team@paperdigest.org
TABLE 1: Paper Digest: CIKM 2020 Highlights
Title | Authors | Highlight | |
---|---|---|---|
1 | Ensemble Block Co-clustering: A Unified Framework for Text Data | Séverine Affeldt; Lazhar Labiod; Mohamed Nadif; | In this paper, we propose a unified framework for Ensemble Block Co-clustering (EBCO), which aims to fuse multiple basic co-clusterings into a consensus structured affinity matrix. |
2 | Learning to Personalize for Web Search Sessions | Saad Aloteibi; Stephen Clark; | In this paper, we formulate session search as a personalization task under the framework of learning to rank. |
3 | Optimization of Answer Set Programs for Consistent Query Answering by Means of First-Order Rewriting | Aziz Amezian El Khalfioui; Jonathan Joertz; Dorian Labeeuw; Gaëtan Staquet; Jef Wijsen; | Optimization of Answer Set Programs for Consistent Query Answering by Means of First-Order Rewriting |
4 | Spectral Relaxations and Fair Densest Subgraphs | Aris Anagnostopoulos; Luca Becchetti; Adriano Fazzone; Cristina Menghini; Chris Schwiegelshohn; | In this paper, we address the problem of identifying a densest subgraph, while ensuring that none of one binary protected attribute is disparately impacted. |
5 | The Impact of Negative Triple Generation Strategies and Anomalies on Knowledge Graph Completion | Iti Bansal; Sudhanshu Tiwari; Carlos R. Rivero; | In this paper, we analyze the impact of negative triple generation during both training and testing on translation-based completion models. |
6 | tdGraphEmbed: Temporal Dynamic Graph-Level Embedding | Moran Beladev; Lior Rokach; Gilad Katz; Ido Guy; Kira Radinsky; | In this study, we present tdGraphEmbed, a novel temporal graph-level embedding approach that extend the random-walk based node embedding methods to globally embed both the nodes of the graph and its representation at each time step, thus creating representation of the entire graph at each step. |
7 | Learning to Match Jobs with Resumes from Sparse Interaction Data using Multi-View Co-Teaching Network | Shuqing Bian; Xu Chen; Wayne Xin Zhao; Kun Zhou; Yupeng Hou; Yang Song; Tao Zhang; Ji-Rong Wen; | To alleviate these problems, in this paper, we propose a novel multi-view co-teaching network from sparse interaction data for job-resume matching. |
8 | Incremental and Parallel Computation of Structural Graph Summaries for Evolving Graphs | Till Blume; David Richerby; Ansgar Scherp; | Existing graph summarization algorithms are tailored to specific graph summary models, only support one-time batch computation, are designed and implemented for a specific task, or evaluated using static graphs. Our novel, incremental, parallel algorithm addresses all these shortcomings. |
9 | Do People and Neural Nets Pay Attention to the Same Words: Studying Eye-tracking Data for Non-factoid QA Evaluation | Valeria Bolotova; Vladislav Blinov; Yukun Zheng; W. Bruce Croft; Falk Scholer; Mark Sanderson; | Similarity was found, consequently, we propose a method to exploit the BERT attention map to generate suggestions that simulate eye gaze during user evaluation. |
10 | Fast and Scalable Outlier Detection with Sorted Hypercubes | Eugênio F. Cabral; Robson L.F. Cordeiro; | This paper proposes HySortOD — a novel algorithm that uses an efficient hypercube-ordering-and-searching strategy for fast outlier detection. |
11 | SenticNet 6: Ensemble Application of Symbolic and Subsymbolic AI for Sentiment Analysis | Erik Cambria; Yang Li; Frank Z. Xing; Soujanya Poria; Kenneth Kwok; | In this work, we integrate top-down and bottom-up learning via an ensemble of symbolic and subsymbolic AI tools, which we apply to the interesting problem of polarity detection from text. |
12 | Laconic Image Classification: Human vs. Machine Performance | Javier Carrasco; Aidan Hogan; Jorge Pérez; | We propose laconic classification as a novel way to understand and compare the performance of diverse image classifiers. |
13 | Retrievability based Document Selection for Relevance Feedback with Automatically Generated Query Variants | Anirban Chakraborty; Debasis Ganguly; Owen Conlan; | To mitigate the problem of over-dependence of a pseudo-relevance feedback algorithm on the top-M document set, we make use of a set of equivalence classes of queries rather than one single query. |
14 | Learning Graph-Based Geographical Latent Representation for Point-of-Interest Recommendation | Buru Chang; Gwanghoon Jang; Seoyoon Kim; Jaewoo Kang; | In this paper, we propose a new graph-based geographical latent representation model (GGLR) which can capture highly non-linear geographical influences from complex user-POI networks. |
15 | Continuous-Time Dynamic Graph Learning via Neural Interaction Processes | Xiaofu Chang; Xuqin Liu; Jianfeng Wen; Shuang Li; Yanming Fang; Le Song; Yuan Qi; | To achieve this, we propose a principled graph-neural-based approach to learn continuous-time dynamic embeddings. |
16 | TGCN: Tag Graph Convolutional Network for Tag-Aware Recommendation | Bo Chen; Wei Guo; Ruiming Tang; Xin Xin; Yue Ding; Xiuqiang He; Dong Wang; | In this paper, we propose a novel tag-aware recommendation model named Tag Graph Convolutional Network (TGCN), which leverages the contextual semantics of multi-hop neighbors in the user-tag-item graph to alleviate the above issues. |
17 | An Adaptive Embedding Framework for Heterogeneous Information Networks | Daoyuan Chen; Yaliang Li; Bolin Ding; Ying Shen; | In this paper, we propose a novel adaptive embedding framework, which integrates these two kinds of methods to preserve both topological information and relational information. |
18 | Improving End-to-End Sequential Recommendations with Intent-aware Diversification | Wanyu Chen; Pengjie Ren; Fei Cai; Fei Sun; Maarten de Rijke; | We consider both accuracy and diversity by reformulating SRs as a list generation task, and propose an integrated approach with an end-to-end neural model, called intent-aware diversified sequential recommendation (IDSR). |
19 | Unsupervised Cyberbullying Detection via Time-Informed Gaussian Mixture Model | Lu Cheng; Kai Shu; Siqi Wu; Yasin N. Silva; Deborah L. Hall; Huan Liu; | To address these limitations, this work introduces a principled approach forunsupervised cyberbullying detection. |
20 | Product Quality Prediction with Convolutional Encoder-Decoder Architecture and Transfer Learning | Hao-Yi Chih; Yao-Chung Fan; Wen-Chih Peng; Hai-Yuan Kuo; | In this paper, we propose a deep convolutional model for predicting wafer fabrication quality in an intelligent integrated-circuit manufacturing application. |
21 | Matching in Selective and Balanced Representation Space for Treatment Effects Estimation | Zhixuan Chu; Stephen L. Rathbun; Sheng Li; | We propose a feature selection representation matching (FSRM) method based on deep representation learning and matching, which maps the original covariate space into a selective, nonlinear, and balanced representation space, and then conducts matching in the learned representation space. |
22 | TPR: Text-aware Preference Ranking for Recommender Systems | Yu-Neng Chuang; Chih-Ming Chen; Chuan-Ju Wang; Ming-Feng Tsai; Yuan Fang; Ee-Peng Lim; | In this work, we present a framework of text-aware preference ranking (TPR) for top- recommendation, in which we comprehensively model the joint association of user-item interaction and relations between items and associated text. |
23 | Offline Evaluation by Maximum Similarity to an Ideal Ranking | Charles L. A. Clarke; Mark D. Smucker; Alexandra Vtyurina; | Rather than propose extensions to these measures, we instead propose a radical simplification to replace them. |
24 | EPNet: Learning to Exit with Flexible Multi-Branch Network | Xin Dai; Xiangnan Kong; Tian Guo; | In this paper, we investigate the problem of designing a flexible multi-branch network and early-exiting policies that can adapt to the resource consumption to individual inference request without impacting the inference accuracy. |
25 | Cola-GNN: Cross-location Attention based Graph Neural Networks for Long-term ILI Prediction | Songgaojun Deng; Shusen Wang; Huzefa Rangwala; Lijing Wang; Yue Ning; | In this paper, we design a cross-location attention based graph neural network (Cola-GNN) for learning time series embeddings in long-term ILI predictions. |
26 | Opinion-aware Answer Generation for Review-driven Question Answering in E-Commerce | Yang Deng; Wenxuan Zhang; Wai Lam; | In this paper, we tackle opinion-aware answer generation by jointly learning answer generation and opinion mining tasks with a unified model. |
27 | UPON: User Profile Transferring across Networks | Mengting Diao; Zhongbao Zhang; Sen Su; Shuai Gao; Huafeng Cao; | Motivated by such observations, in this paper, we for the first time propose to study the user profiling problem from the transfer learning perspective. |
28 | Evaluating Stochastic Rankings with Expected Exposure | Fernando Diaz; Bhaskar Mitra; Michael D. Ekstrand; Asia J. Biega; Ben Carterette; | We introduce the concept of expected exposure as the average attention ranked items receive from users over repeated samples of the same query. |
29 | Towards Plausible Differentially Private ADMM Based Distributed Machine Learning | Jiahao Ding; Jingyi Wang; Guannan Liang; Jinbo Bi; Miao Pan; | In this paper, we address these concerns by proposing a novel (Improved) Plausible differentially Private ADMM algorithm, called PP-ADMM and IPP-ADMM. |
30 | Graph Prototypical Networks for Few-shot Learning on Attributed Networks | Kaize Ding; Jianling Wang; Jundong Li; Kai Shu; Chenghao Liu; Huan Liu; | To answer these questions, in this paper, we propose a graph meta-learning framework — Graph Prototypical Networks (GPN). |
31 | Neural Formatting for Spreadsheet Tables | Haoyu Dong Dong; Jinyu Wang; Zhouyu Fu; Shi Han; Dongmei Zhang; | In this paper, we propose CellGAN, a neural formatting model for learning and recommending formats of spreadsheet tables. |
32 | Enhancing Graph Neural Network-based Fraud Detectors against Camouflaged Fraudsters | Yingtong Dou; Zhiwei Liu; Li Sun; Yutong Deng; Hao Peng; Philip S. Yu; | Alternatively, we propose a new model named CAmouflage-REsistant GNN (CARE-GNN), to enhance the GNN aggregation process with three unique modules against camouflages. |
33 | Towards Generalizable Deepfake Detection with Locality-aware AutoEncoder | Mengnan Du; Shiva Pentyala; Yuening Li; Xia Hu; | Motivated by the fine-grained nature and spatial locality characteristics of deepfakes, we propose Locality-Aware AutoEncoder (LAE) to bridge the generalization gap. |
34 | Quality-Aware Ranking of Arguments | Lorik Dumani; Ralf Schenkel; | We present a quality-aware ranking framework for arguments already extracted from texts and represented as argument graphs, considering multiple established quality measures. |
35 | RelSen: An Optimization-based Framework for Simultaneously Sensor Reliability Monitoring and Data Cleaning | Cheng Feng; Xiao Liang; Daniel Schneegass; PengWei Tian; | In this work, we propose RelSen, a novel optimization-based framework to address the two problems simultaneously via utilizing the mutual dependence between them. |
36 | Critically Examining the Claimed Value of Convolutions over User-Item Embedding Maps for Recommender Systems | Maurizio Ferrari Dacrema; Federico Parroni; Paolo Cremonesi; Dietmar Jannach; | In this work, we show through analytical considerations and empirical evaluations that the claimed gains reported in the literature cannot be attributed to the ability of CNNs to model embedding correlations, as argued in the original papers. |
37 | Query-to-Session Matching: Do NOT Forget History and Future during Response Selection for Multi-Turn Dialogue Systems | Zhenxin Fu; Shaobo Cui; Feng Ji; Ji Zhang; Haiqing Chen; Dongyan Zhao; Rui Yan; | Inspired by such motivation, we propose a query-to-session matching (QSM) framework to make full use of the session information: matching the query with the candidate session instead of the response only. |
38 | Learning from Textual Data in Database Systems | Michael Günther; Philipp Oehme; Maik Thiele; Wolfgang Lehner; | We defined relation retrofitting as an optimization problem, present an efficient algorithm solving it, and investigate the influence of various hyperparameters. |
39 | Rotate3D: Representing Relations as Rotations in Three-Dimensional Space for Knowledge Graph Embedding | Chang Gao; Chengjie Sun; Lili Shan; Lei Lin; Mingjiang Wang; | To address this issue, we propose a new model called Rotate3D, which maps entities to the three-dimensional space and defines relations as rotations from head entities to tail entities. |
40 | Set-Sequence-Graph: A Multi-View Approach Towards Exploiting Reviews for Recommendation | Jingyue Gao; Yang Lin; Yasha Wang; Xiting Wang; Zhao Yang; Yuanduo He; Xu Chu; | To overcome these limitations, we propose a multi-view approach named Set-Sequence-Graph (SSG), to augment existing single-view (i.e., view of set) methods by introducing two additional views of exploiting reviews: sequence and graph. |
41 | How and Why is An Answer (Still) Correct? Maintaining Provenance in Dynamic Knowledge Graphs | Garima Gaur; Arnab Bhattacharya; Srikanta Bedathur; | Addressing this issue, we present a framework HUKA that uses provenance polynomials for tracking the derivation of query results over knowledge graphs by encoding the edges involved in generating the answer. |
42 | MICK: A Meta-Learning Framework for Few-shot Relation Classification with Small Training Data | Xiaoqing Geng; Xiwen Chen; Kenny Q. Zhu; Libin Shen; Yinggong Zhao; | In this paper, we tackle an even harder problem by further limiting the amount of data available at training time. |
43 | Knowledge Graph Embedding Preserving Soft Logical Regularity | Shu Guo; Lin Li; Zhen Hui; Lingshuai Meng; Bingnan Ma; Wei Liu; Lihong Wang; Haibin Zhai; Hong Zhang; | This paper proposes a highly scalable and effective method for preserving soft logical regularities by imposing soft rule constraints on relation representations. |
44 | GraSeq: Graph and Sequence Fusion Learning for Molecular Property Prediction | Zhichun Guo; Wenhao Yu; Chuxu Zhang; Meng Jiang; Nitesh V. Chawla; | In this paper, we propose GraSeq, a joint graph and sequence representation learning model for molecular property prediction. |
45 | Modelling User Behavior Dynamics with Embeddings | Lei Han; Alessandro Checco; Djellel Difallah; Gianluca Demartini; Shazia Sadiq; | In this paper, we present a user behavior model built using behavior embeddings to compare behaviors and their change over time. |
46 | Genetic Meta-Structure Search for Recommendation on Heterogeneous Information Network | Zhenyu Han; Fengli Xu; Jinghan Shi; Yu Shang; Haorui Ma; Pan Hui; Yong Li; | To address these challenges, we propose Genetic Meta-Structure Search (GEMS) to automatically optimize meta-structure designs for recommendation on HINs. |
47 | Ranking Enhanced Dialogue Generation | Changying Hao; Liang Pang; Yanyan Lan; Fei Sun; Jiafeng Guo; Xueqi Cheng; | To tackle this problem, we propose a Ranking Enhanced Dialogue generation framework in this paper. |
48 | Privacy-Preserving Classification with Secret Vector Machines | Valentin Hartmann; Konark Modi; Josep M. Pujol; Robert West; | We address this issue by proposing a novel way of training a supervised classifier in a distributed setting akin to the recently proposed federated learning paradigm, but under the stricter privacy requirement that the server that trains the model is assumed to be untrusted and potentially malicious. |
49 | Learning to Selectively Update State Neurons in Recurrent Networks | Thomas Hartvigsen; Cansu Sen; Xiangnan Kong; Elke Rundensteiner; | To overcome this, we instead design the first fully-learned approach, SA-RNN, that augments any RNN by predicting discrete update patterns at the fine granularity of individual hidden state neurons. |
50 | Hypergraph Random Walks, Laplacians, and Clustering | Koby Hayashi; Sinan G. Aksoy; Cheong Hee Park; Haesun Park; | We propose a flexible framework for clustering hypergraph-structured data based on recently proposed random walks utilizing edge-dependent vertex weights. |
51 | VN Network: Embedding Newly Emerging Entities with Virtual Neighbors | Yongquan He; Zhihan Wang; Peng Zhang; Zhaopeng Tu; Zhaochun Ren; | In this paper, we propose a novel framework, namely Virtual Neighbor (VN) network, to address three key challenges. |
52 | WMEgo: Willingness Maximization for Ego Network Data Extraction in Online Social Networks | Bay-Yuan Hsu; Chih-Ya Shen; Ming-Yi Chang; | Therefore, in this paper, we make our first attempt to address this issue by proposing a new research problem, named Willingness Maximization for Ego Network Extraction in Online Social Networks (WMEgo), to identify a set of ego networks from the OSN such that the willingness of the users to contribute their data is maximized. |
53 | Learning to Detect Relevant Contexts and Knowledge for Response Selection in Retrieval-based Dialogue Systems | Kai Hua; Zhiyuan Feng; Chongyang Tao; Rui Yan; Lu Zhang; | To address this problem, we propose a multi-turn Response Selection Model that can Detect the relevant parts of the Context and Knowledge collection (RSM-DCK). |
54 | Image Captioning with Internal and External Knowledge | Feicheng Huang; Zhixin Li; Shengjia Chen; Canlong Zhang; Huifang Ma; | In this paper, we make our efforts to reason about more accurate and meaningful captions. |
55 | GNNVis: Visualize Large-Scale Data by Learning a Graph Neural Network Representation | Yajun Huang; Jingbin Zhang; Yiyang Yang; Zhiguo Gong; Zhifeng Hao; | In this work, we study the parametric (supervised) model which is capable to learn a mapping between high-dimensional data space Rd and low-dimensional latent space Rs with similarity structure in Rd preserved where s l d. |
56 | Predicting Economic Growth by Region Embedding: A Multigraph Convolutional Network Approach | Bo Hui; Da Yan; Wei-Shinn Ku; Wenlu Wang; | By effectively utilizing open data on the Internet such as government data (e.g., from US Census Bureau) and third-party data for supervised learning, we propose to first construct a multigraph that captures the various relationships between regions such as direct flight connections and shared school districts, and then learn region embeddings using a novel graph convolutional network architecture. |
57 | Sequential Recommender via Time-aware Attentive Memory Network | Wendi Ji; Keqiang Wang; Xiaoling Wang; Tingwei Chen; Alexandra Cristea; | In this paper, we propose a temporal gating methodology to improve attention mechanism and recurrent units, so that temporal information can be considered in both information filtering and state transition. |
58 | MARU: Meta-context Aware Random Walks for Heterogeneous Network Representation Learning | Jyun-Yu Jiang; Zeyu Li; Chelsea J.-T. Ju; Wei Wang; | In this paper, we propose Meta-context Aware Random Walks (MARU) to overcome these challenges, thereby learning richer and more unbiased representations for heterogeneous networks. |
59 | Partial Relationship Aware Influence Diffusion via a Multi-channel Encoding Scheme for Social Recommendation | Bo Jin; Ke Cheng; Liang Zhang; Yanjie Fu; Minghao Yin; Lu Jiang; | To this end, we propose a partial relationship aware influence diffusion structure via a computationally efficient multi-channel encoding scheme. |
60 | Social Factors in Closed-Network Content Consumption | Parisa Kaghazgaran; Maarten Bos; Leonardo Neves; Neil Shah; | We propose models for patterns in users’ time-spending behaviors across friends, and observe that viewers preferentially and consistently spend more time on content from certain friends, even without considering any explicit notion of intrinsic content value. |
61 | DE-RRD: A Knowledge Distillation Framework for Recommender System | SeongKu Kang; Junyoung Hwang; Wonbin Kweon; Hwanjo Yu; | In this paper, we propose a novel knowledge distillation framework for recommender system, called DE-RRD, which enables the student model to learn from the latent knowledge encoded in the teacher model as well as from the teacher’s predictions. |
62 | Collective Embedding with Feature Importance: A Unified Approach for Spatiotemporal Network Embedding | Dakshak Keerthi Chandra; Pengyang Wang; Jennifer Leopold; Yanjie Fu; | In this paper, we propose a collective embedding framework that leverages the use of auto-encoders and Laplacian score to learn effective embeddings of spatiotemporal networks of urban communities. |
63 | AutoFeature: Searching for Feature Interactions and Their Architectures for Click-through Rate Prediction | Farhan Khawar; Xu Hang; Ruiming Tang; Bin Liu; Zhenguo Li; Xiuqiang He; | To address these issues, we propose a neural architecture search based approach called AutoFeature that automatically finds essential feature interactions and selects an appropriate structure to model each of these interactions. |
64 | Selecting Influential Features by a Learnable Content-Aware Linear Threshold Model | Ansh Khurana; Alvis Logins; Panagiotis Karras; | We propose a method that learns the parameters of the CALT model and adapt the SimPath diffusion estimation method to build a heuristic for the influential feature selection problem. |
65 | Describing and Predicting Online Items with Reshare Cascades via Dual Mixture Self-exciting Processes | Quyu Kong; Marian-Andrei Rizoiu; Lexing Xie; | This work addresses these shortcomings by proposing dual mixture self-exciting processes to jointly learn from groups of cascades. |
66 | Extracting N-ary Facts from Wikipedia Table Clusters | Benno Kruit; Peter Boncz; Jacopo Urbani; | We propose a novel knowledge extraction technique that tackles these problems. |
67 | Live Multi-Streaming and Donation Recommendations via Coupled Donation-Response Tensor Factorization | Hsu-Chao Lai; Jui-Yi Tsai; Hong-Han Shuai; Jiun-Long Huang; Wang-Chien Lee; De-Nian Yang; | In this paper, we introduce Multi-Stream Party (MSP) and formulate a new multi-streaming recommendation problem, called Donation and MSP Recommendation (DAMRec). |
68 | MERL: Multi-View Edge Representation Learning in Social Networks | Yi-Yu Lai; Jennifer Neville; | In this paper, we propose a new methodology, MERL, that (1) captures asymmetry in multiple views by learning well-defined edge representations that are responsive to the difference between the source and destination node roles, and (2) incorporates textual communications to identify multiple source of social signals (e.g. strength and affinity) that moderate the impact of different views between users. |
69 | Towards Temporal Knowledge Graph Embeddings with Arbitrary Time Precision | Julien Leblay; Melisachew Wudage Chekol; Xin Liu; | In this work, we introduce and evaluate an approach that gracefully adjusts to time validity of virtually any granularity. |
70 | News Recommendation with Topic-Enriched Knowledge Graphs | Dongho Lee; Byungkook Oh; Seungmin Seo; Kyong-Ho Lee; | To cope with the above problem, in this paper, we propose Topic-Enriched Knowledge Graph Recommendation System(TEKGR). |
71 | Cross-sentence N-ary Relation Extraction using Entity Link and Discourse Relation | Sanghak Lee; Seungmin Seo; Byungkook Oh; Kyong-Ho Lee; Donghoon Shin; Yeonsoo Lee; | This paper presents an efficient method of extracting n-ary relations from multiple sentences which is called Entity-path and Discourse relation-centric Relation Extractor (EDCRE). |
72 | Knowledge Adaption for Demand Prediction based on Multi-task Memory Neural Network | Can Li; Lei Bai; Wei Liu; Lina Yao; S. Travis Waller; | As such, we propose to enhance the demand prediction of station-sparse modes with the data from station-intensive mode and design a Memory-Augmented Multi-task Re current Network (MATURE) to derive the transferable demand patterns from each mode and boost the prediction of station-sparse modes through adapting the relevant patterns from the station-intensive mode. |
73 | Learning with Noisy Partial Labels by Simultaneously Leveraging Global and Local Consistencies | Changchun Li; Ximing Li; Jihong Ouyang; | To alleviate this problem, we propose a novel PL method, namely PArtial label learNing by simultaneously leveraging GlObal and Local consIsteNcies (Pangolin). |
74 | Knowledge-Enhanced Personalized Review Generation with Capsule Graph Neural Network | Junyi Li; Siqing Li; Wayne Xin Zhao; Gaole He; Zhicheng Wei; Nicholas Jing Yuan; Ji-Rong Wen; | To address the above issues, we propose a novel knowledgeenhanced PRG model based on capsule graph neural network (CapsGNN). |
75 | Seed-free Graph De-anonymiztiation with Adversarial Learning | Kaiyang Li; Guoming Lu; Guangchun Luo; Zhipeng Cai; | In this paper, a seed-free graph de-anonymization method is proposed, where a deep neural network is adopted to learn features and an adversarial framework is employed for node matching. |
76 | Generate Neural Template Explanations for Recommendation | Lei Li; Yongfeng Zhang; Li Chen; | In an attempt to benefit both sentence expressiveness and quality, we propose a Neural Template (NETE) explanation generation framework, which brings the best of both worlds by learning sentence templates from data and generating template-controlled sentences that comment about specific features. |
77 | A Topic and Concept Integrated Model for Thread Recommendation in Online Health Communities | Mingda Li; Weiting Gao; Yi Chen; | To help users find relevant information easily, we present CLIR, an effective system for recommending relevant discussion threads to users in OHCs. |
78 | Trapping Malicious Crawlers in Social Networks | Shiju Li; Chul-Ho Lee; Do Young Eun; | In this paper, we study a problem of trapping malicious web crawlers in social networks to minimize the attacks from crawlers with malicious intents to steal personal/private information. |
79 | Deep Time-Aware Item Evolution Network for Click-Through Rate Prediction | Xiang Li; Chao Wang; Bin Tong; Jiwei Tan; Xiaoyi Zeng; Tao Zhuang; | In this work, we propose a CTR prediction model TIEN based on the time-aware item behavior. |
80 | Learning Better Representations for Neural Information Retrieval with Graph Information | Xiangsheng Li; Maarten de Rijke; Yiqun Liu; Jiaxin Mao; Weizhi Ma; Min Zhang; Shaoping Ma; | In this study, we aim to incorporate this rich information encoded in these two graphs into existing neural ranking models. |
81 | Cooperative Multi-Agent Reinforcement Learning in Express System | Yexin Li; Yu Zheng; Qiang Yang; | In this paper, we propose a reinforcement learning based framework to learn courier dispatching policies. |
82 | Meta-Learning for Neural Relation Classification with Distant Supervision | Zhenzhen Li; Jian-Yun Nie; Benyou Wang; Pan Du; Yuhan Zhang; Lixin Zou; Dongsheng Li; | In this paper, we propose a meta-learning based approach, which learns to reweight noisy training data under the guidance of reference data. |
83 | Aspect-invariant Sentiment Features Learning: Adversarial Multi-task Learning for Aspect-based Sentiment Analysis | Bin Liang; Rongdi Yin; Lin Gui; Jiachen Du; Yulan He; Ruifeng Xu; | Hence, in this paper, we propose an Adversarial Multi-task Learning framework to identify the aspect-invariant/dependent sentiment expressions without extra annotations. |
84 | Attributed Network Embedding based on Mutual Information Estimation | Xiaomin Liang; Daifeng Li; Andrew Madden; | We attempt to use local embeddings to represent local aspect information and propose InfomaxANE which encodes both global and local embeddings from the perspective of mutual information. |
85 | STP-UDGAT: Spatial-Temporal-Preference User Dimensional Graph Attention Network for Next POI Recommendation | Nicholas Lim; Bryan Hooi; See-Kiong Ng; Xueou Wang; Yong Liang Goh; Renrong Weng; Jagannadan Varadarajan; | In this work, we propose a Spatial-Temporal-Preference User Dimensional Graph Attention Network (STP-UDGAT), a novel explore-exploit model that concurrently exploits personalized user preferences and explores new POIs in global spatial-temporal-preference (STP) neighbourhoods, while allowing users to selectively learn from other users. |
86 | Attacking Recommender Systems with Augmented User Profiles | Chen Lin; Si Chen; Hui Li; Yanghua Xiao; Lianyun Li; Qian Yang; | In this paper, we study the shilling attack: a subsistent and profitable attack where an adversarial party injects a number of user profiles to promote or demote a target item. |
87 | Jointly Modeling Individual Student Behaviors and Social Influence for Prediction Tasks | Haobing Liu; Yanmin Zhu; Tianzi Zang; Jiadi Yu; Haibin Cai; | In this paper, we aim to propose a general deep neural network which can jointly model student heterogeneous daily behaviors generated from digital footprints and social influence to deal with prediction tasks. |
88 | Fusing Parallel Social Contexts within Flexible-Order Proximity for Microblog Topic Detection | Hongyu Liu; Ruifang He; Haocheng Wang; Bo Wang; | To this end, we propose a novel random walk based Parallel Social Contexts Fusion Topic Model (PCFTM) for weibo conversations. |
89 | Cross Domain Recommendation via Bi-directional Transfer Graph Collaborative Filtering Networks | Meng Liu; Jianjun Li; Guohui Li; Peng Pan; | In this paper, we propose a novel Bi-directional Transfer learning method for cross-domain recommendation by using Graph Collaborative Filtering network as the base model (BiTGCF). |
90 | Explainable Recommender Systems via Resolving Learning Representations | Ninghao Liu; Yong Ge; Li Li; Xia Hu; Rui Chen; Soo-Hyun Choi; | In this paper, after formally introducing the elements that are related to model explainability, we propose a novel explainable recommendation model through improving the transparency of the representation learning process. |
91 | Deep Spatio-Temporal Multiple Domain Fusion Network for Urban Anomalies Detection | Ruiqiang Liu; Shuai Zhao; Bo Cheng; Hao Yang; Haina Tang; Taoyu Li; | In this paper, we propose a novel end-to-end deep learning based framework, namely deep spatio-temporal multiple domain fusion network to collect the impacts of urban anomalies on multiple datasets and detect anomalies in each region of the city at next time interval in turn. |
92 | Structural Relationship Representation Learning with Graph Embedding for Personalized Product Search | Shang Liu; Wanli Gu; Gao Cong; Fuzheng Zhang; | In this work, we propose to consider structural relationship in users’ product search scenario with graph embedding by latent representation learning. |
93 | Personalized Re-ranking with Item Relationships for E-commerce | Weiwen Liu; Qing Liu; Ruiming Tang; Junyang Chen; Xiuqiang He; Pheng Ann Heng; | We develop a graph neural network based framework, IRGPR, to explicitly model transitive item relationships by recursively aggregating relational information from multi-hop neighborhoods. |
94 | An NVM SSD-Optimized Query Processing Framework | Xinyu Liu; Yu Pan; Yusen Li; Gang Wang; Xiaoguang Liu; | In this paper, we propose an NVM SSD-optimized query processing framework, aiming to address both the latency and bandwidth issues of using NVM in search engines. |
95 | Shapley Values and Meta-Explanations for Probabilistic Graphical Model Inference | Yifei Liu; Chao Chen; Yazheng Liu; Xi Zhang; Sihong Xie; | As brute-force computation of the Shapley values is challenging, we propose GraphShapley, an approximation algorithm that exploits the decomposability of Shapley values, the structure of MRFs, and the iterative nature of BP inference to speed up the computation. |
96 | Recommending Inferior Results: A General and Feature-Free Model for Spam Detection | Yuli Liu; | To further boost detection performance, we incorporate the pairwise ranking detection method and the widely used structure-based algorithm into an integrated framework. |
97 | Towards Locality-Aware Meta-Learning of Tail Node Embeddings on Networks | Zemin Liu; Wentao Zhang; Yuan Fang; Xinming Zhang; Steven C.H. Hoi; | To reduce overfitting in the personalization, we propose a locality-aware meta-learning framework, called meta-tail2vec, which learns to learn the regression model for the tail nodes at different localities. |
98 | Feature Fusion Based Subgraph Classification for Link Prediction | Zheyi Liu; Darong Lai; Chuanyou Li; Meng Wang; | In this study, we established the Subgraph Hierarchy Feature Fusion (SHFF) model for link prediction. |
99 | Fast Attributed Multiplex Heterogeneous Network Embedding | Zhijun Liu; Chao Huang; Yanwei Yu; Baode Fan; Junyu Dong; | In this work, we propose a Fast Attributed Multiplex heterogeneous network Embedding framework (FAME) for large-scale network data, by mapping the units from different modalities (i.e., network topological structures, various node features and relations) into the same latent space in a very efficient way. |
100 | Dynamic Representation Learning for Large-Scale Attributed Networks | Zhijun Liu; Chao Huang; Yanwei Yu; Peng Song; Baode Fan; Junyu Dong; | In this work, we address these challenges by developing the DRLAN-Dynamic Representation Learning framework for large-scale Attributed Networks. |
101 | Dual Head-wise Coattention Network for Machine Comprehension with Multiple-Choice Questions | Zhuang Liu; Kaiyu Huang; Degen Huang; Zhuang Liu; Jun Zhao; | In this paper, we present a novel architecture, Dual Head-wise Coattention network (called DHC), which is a simple and efficient attention neural network designed to perform multiple-choice MC task. |
102 | Spatiotemporal Adaptive Gated Graph Convolution Network for Urban Traffic Flow Forecasting | Bin Lu; Xiaoying Gan; Haiming Jin; Luoyi Fu; Haisong Zhang; | In this paper, we exploit spatiotemporal correlation of urban traffic flow and construct a dynamic weighted graph by seeking both spatial neighbors and semantic neighbors of road nodes. |
103 | Probabilistic Dynamic Non-negative Group Factor Model for Multi-source Text Mining | Chien Lu; Jaakko Peltonen; Jyrki Nummenmaa; Kalervo Järvelin; | We introduce a probabilistic, Gaussian Process-based, more inclusive NMF-based model which jointly analyzes nonnegative data such as text data word content from multiple sources in a temporal dynamic manner. |
104 | Hierarchical Active Learning with Overlapping Regions | Zhipeng Luo; Milos Hauskrecht; | To minimize the number and complexity of region-based queries, we propose and develop a hierarchical active learning solution that aims at incrementally building a concise hierarchy of regions. |
105 | Adaptive-Step Graph Meta-Learner for Few-Shot Graph Classification | Ning Ma; Jiajun Bu; Jieyu Yang; Zhen Zhang; Chengwei Yao; Zhi Yu; Sheng Zhou; Xifeng Yan; | More specifically, (1) we propose a novel framework consisting of a graph meta-learner, which uses GNNs based modules for fast adaptation on graph data, and a step controller for the robustness and generalization of meta-learner; (2) we provide quantitative analysis for the framework and give a graph-dependent upper bound of the generalization error based on our framework; (3) the extensive experiments on real-world datasets demonstrate that our framework gets state-of-the-art results on several few-shot graph classification tasks compared to baselines. |
106 | Feature Selection on Data Stream via Multi-Cluster Structure Preservation | Rui Ma; Yijie Wang; Li Cheng; | To address this issue, we propose an online unsupervised Feature Selection method via Multi-Cluster structure Preservation (FSMCP for short). |
107 | PSTIE: Time Information Enhanced Personalized Search | Zhengyi Ma; Zhicheng Dou; Guanyue Bian; Ji-Rong Wen; | In this paper, we propose PSTIE, a fine-grained Time Information Enhanced model to construct more accurate user interest representations for Personalized Search. |
108 | Examining the Additivity of Top-k Query Processing Innovations | Joel Mackenzie; Alistair Moffat; | We employ four query processing algorithms, four compression codecs, and all possible combinations of four distinct further optimizations, and compare the performance of the 256 resulting systems to determine when and how different optimizations interact. |
109 | Relational Reflection Entity Alignment | Xin Mao; Wenting Wang; Huimin Xu; Yuanbin Wu; Man Lan; | In this paper, we abstract existing entity alignment methods into a unified framework, Shape-Builder & Alignment, which not only successfully explains the above phenomena but also derives two key criteria for an ideal transformation operation. |
110 | CSNE: Conditional Signed Network Embedding | Alexandru Mara; Yoosof Mashayekhi; Jefrey Lijffijt; Tijl de Bie; | In this context, we introduce conditional signed network embedding (CSNE). |
111 | Learning to Distract: A Hierarchical Multi-Decoder Network for Automated Generation of Long Distractors for Multiple-Choice Questions for Reading Comprehension | Kaushal Kumar Maurya; Maunendra Sankar Desarkar; | In this paper, we propose a novel Hierarchical Multi-Decoder Network (HMD-Net) consisting of one encoder and three decoders, where each decoder generates a single distractor. |
112 | "Keep it Simple, Lazy" — MetaLazy: A New MetaStrategy for Lazy Text Classification | Luiz Felipe Mendes; Marcos Gonçalves; Washington Cunha; Leonardo Rocha; Thierson Couto-Rosa; Wellington Martins; | We propose a shift in the paradigm to do "more with less" by focusing, at maximum extent, just on the task at hand (e.g., classify a single test instance). |
113 | A Methodology Based on Deep Q-Learning/Genetic Algorithms for Optimizing COVID-19 Pandemic Government Actions | Luis Miralles-Pechuán; Fernando Jiménez; Hiram Ponce; L. Martínez-Villaseñor; | To optimize the best sequences of actions governments can take, we propose a methodology with two approaches, one based on Deep Q-Learning and another one based on Genetic Algorithms. |
114 | SWE2: SubWord Enriched and Significant Word Emphasized Framework for Hate Speech Detection | Guanyi Mou; Pengyi Ye; Kyumin Lee; | In this paper, we address the hate speech problem and propose a novel hate speech detection framework called SWE2, which only relies on the content of messages and automatically identifies hate speech. |
115 | Deep Generative Positive-Unlabeled Learning under Selection Bias | Byeonghu Na; Hyemi Kim; Kyungwoo Song; Weonyoung Joo; Yoon-Yeong Kim; Il-Chul Moon; | Therefore, we introduce the VAE-PU, which is a variant of variational autoencoders to separate two latent variables that generate either features or observation indicators. |
116 | FANG: Leveraging Social Context for Fake News Detection Using Graph Representation | Van-Hoang Nguyen; Kazunari Sugiyama; Preslav Nakov; Min-Yen Kan; | We propose Factual News Graph (FANG), a novel graphical social context representation and learning framework for fake news detection. |
117 | Uncovering Semantic Bias in Neural Network Models Using a Knowledge Graph | Andriy Nikolov; Mathieu d’Aquin; | In this paper we apply rule mining using knowledge graphs in combination with neural network explanation methods to uncover such systematic preferences of trained neural models and capture them in the form of conjunctive rules. |
118 | STP-TrellisNets: Spatial-Temporal Parallel TrellisNets for Metro Station Passenger Flow Prediction | Junjie Ou; Jiahui Sun; Yichen Zhu; Haiming Jin; Yijuan Liu; Fan Zhang; Jianqiang Huang; Xinbing Wang; | Thus, in this paper, we aim to address the problem of accurately predicting metro station passenger (MSP) flows. |
119 | Star Graph Neural Networks for Session-based Recommendation | Zhiqiang Pan; Fei Cai; Wanyu Chen; Honghui Chen; Maarten de Rijke; | We propose Star Graph Neural Networks with Highway Networks (SGNN-HN) for session-based recommendation. |
120 | RKT: Relation-Aware Self-Attention for Knowledge Tracing | Shalini Pandey; Jaideep Srivastava; | In this paper, we propose a novel Relation-aware self-attention model for Knowledge Tracing (RKT). |
121 | ST-GRAT: A Novel Spatio-temporal Graph Attention Networks for Accurately Forecasting Dynamically Changing Road Speed | Cheonbok Park; Chunggi Lee; Hyojin Bahng; Yunwon Tae; Seungmin Jin; Kihwan Kim; Sungahn Ko; Jaegul Choo; | This paper proposes a novel spatio-temporal graph attention (ST-GRAT) that effectively captures the spatio-temporal dynamics in road networks. |
122 | Minimal Edit-Based Diffs for Large Trees | Mateusz Pawlik; Nikolaus Augsten; | In this paper, we leverage the fact that the difference between two versions of a tree is typically much smaller than the overall tree size. |
123 | Efficient Detection of Data Dependency Violations | Eduardo H. M. Pena; Edson R. Lucas Filho; Eduardo C. de Almeida; Felix Naumann; | We describe its execution model, which operates on a compressed block of tuples at-a-time, and we present various algorithms that take advantage of the predicate form in the DCs to provide effective code patterns. |
124 | EnDeA: Ensemble based Decoupled Adversarial Learning for Identifying Infrastructure Damage during Disasters | Shalini Priya; Apoorva Upadhyaya; Manish Bhanu; Sourav Kumar Dandapat; Joydeep Chandra; | In this paper we introduce an Ensemble learning approach with a Decoupled Adversarial (EnDeA) model to classify infrastructure damage tweets in a target tweet dataset. |
125 | G-CREWE: Graph CompREssion With Embedding for Network Alignment | Kyle K. Qin; Flora D. Salim; Yongli Ren; Wei Shao; Mark Heimann; Danai Koutra; | In this paper, we propose a framework, called G-CREWE (Graph CompREssion With Embedding) to solve the network alignment problem. |
126 | Diversifying Search Results using Self-Attention Network | Xubo Qin; Zhicheng Dou; Ji-Rong Wen; | In this paper, we propose a new supervised diversification framework to address this issue. |
127 | Time-Efficient Geo-Obfuscation to Protect Worker Location Privacy over Road Networks in Spatial Crowdsourcing | Chenxi Qiu; Anna Squicciarini; Zhouzhao Li; Ce Pang; Li Yan; | In this paper, we tackle the SC worker location privacy problem over road networks. |
128 | Hierarchical Query Graph Generation for Complex Question Answering over Knowledge Graph | Yunqi Qiu; Kun Zhang; Yuanzhuo Wang; Xiaolong Jin; Long Bai; Saiping Guan; Xueqi Cheng; | This paper proposes a Director-Actor-Critic framework to overcome these challenges. |
129 | Robust Irregular Tensor Factorization and Completion for Temporal Health Data Analysis | Yifei Ren; Jian Lou; Li Xiong; Joyce C. Ho; | We propose REPAIR, a Robust tEmporal PARAFAC2 method for IRregular tensor factorization and completion method, to complete an irregular tensor and extract phenotypes in the presence of missing and erroneous values. |
130 | The COVID-19 Infodemic: Can the Crowd Judge Recent Misinformation Objectively? | Kevin Roitero; Michael Soprano; Beatrice Portelli; Damiano Spina; Vincenzo Della Mea; Giuseppe Serra; Stefano Mizzaro; Gianluca Demartini; | We specifically target statements related to the COVID-19 health emergency, that is still ongoing at the time of the study and has arguably caused an increase of the amount of misinformation that is spreading online (a phenomenon for which the term "infodemic" has been used). |
131 | ZSCRGAN: A GAN-based Expectation Maximization Model for Zero-Shot Retrieval of Images from Textual Descriptions | Anurag Roy; Vinay Kumar Verma; Kripabandhu Ghosh; Saptarshi Ghosh; | In this paper, we propose a novel GAN-based model for zero-shot text to image retrieval. |
132 | Characteristic Functions on Graphs: Birds of a Feather, from Statistical Descriptors to Parametric Models | Benedek Rozemberczki; Rik Sarkar; | In this paper, we propose a flexible notion of characteristic functions defined on graph vertices to describe the distribution of vertex features at multiple scales. |
133 | A GAN-based Framework for Modeling Hashtag Popularity Dynamics Using Assistive Information | Avirup Saha; Niloy Ganguly; | In this paper, we propose a marked TPP based on Generative Adversarial Networks (GANs) which can seamlessly incorporate the assistive information necessary to capture the above effects and successfully forecast distant popularity trends. |
134 | Index Obfuscation for Oblivious Document Retrieval in a Trusted Execution Environment | Jinjin Shao; Shiyu Ji; Alvin Oliver Glova; Yifan Qiao; Tao Yang; Tim Sherwood; | This paper proposes an efficiency-enhanced design that obfuscates the inverted index structure with posting bucketing and document ID masking, which aims to hide document-term association and avoid the access pattern leakage. |
135 | Auxiliary-task Based Deep Reinforcement Learning for Participant Selection Problem in Mobile Crowdsourcing | Wei Shen; Xiaonan He; Chuheng Zhang; Qiang Ni; Wanchun Dou; Yan Wang; | To deal with this issue, we formulate the participant selection problem as a reinforcement learning problem and propose to solve it with a novel method, which we call auxiliary-task based deep reinforcement learning (ADRL). |
136 | Neural Logic Reasoning | Shaoyun Shi; Hanxiong Chen; Weizhi Ma; Jiaxin Mao; Min Zhang; Yongfeng Zhang; | In this paper, we propose Logic-Integrated Neural Network (LINN) to integrate the power of deep learning and logic reasoning. |
137 | METEOR: Learning Memory and Time Efficient Representations from Multi-modal Data Streams | Amila Silva; Shanika Karunasekera; Christopher Leckie; Ling Luo; | These studies aim to understand higher-level contextual information (e.g., a Twitter message) by jointly learning embeddings for the lower-level semantic units in different modalities (e.g., text, user, and location of a Twitter message). |
138 | Carpe Diem, Seize the Samples Uncertain "at the Moment" for Adaptive Batch Selection | Hwanjun Song; Minseok Kim; Sundong Kim; Jae-Gil Lee; | In this paper, we propose a novel adaptive batch selection algorithm called Recency Bias that exploits the uncertain samples predicted inconsistently in recent iterations. |
139 | Continual Domain Adaptation for Machine Reading Comprehension | Lixin Su; Jiafeng Guo; Ruqing Zhang; Yixing Fan; Yanyan Lan; Xueqi Cheng; | To tackle the CDA task, we propose several BERT-based continual learning MRC models using either regularization-based methodology or dynamic-architecture paradigm. |
140 | Multi-modal Knowledge Graphs for Recommender Systems | Rui Sun; Xuezhi Cao; Yan Zhao; Junchen Wan; Kun Zhou; Fuzheng Zhang; Zhongyuan Wang; Kai Zheng; | In this paper, we propose Multi-modal Knowledge Graph Attention Network (MKGAT) to better enhance recommender systems by leveraging multi-modal knowledge. |
141 | Anomaly Subgraph Detection with Feature Transfer | Ying Sun; Wenjun Wang; Nannan Wu; Wei Yu; Xue Chen; | In this paper, to detect anomaly in a graph with insufficient anomalous features, we propose a pioneering approach ASD-FT (Anomaly Subgraph Detection with Feature Transfer) based on a strategy of anomalous feature transfers between different layers of a multilayer graph. |
142 | OHEA: Secure Data Aggregation in Wireless Sensor Networks against Untrusted Sensors | Li Tang; Haibo Hu; | This paper proposes Onion Homomorphic Encryption-based Aggregation (OHEA), where sources form groups with their dedicated encryption keys, a.k.a., the group keys. |
143 | Investigating and Mitigating Degree-Related Biases in Graph Convoltuional Networks | Xianfeng Tang; Huaxiu Yao; Yiwei Sun; Yiqi Wang; Jiliang Tang; Charu Aggarwal; Prasenjit Mitra; Suhang Wang; | In this paper, we analyze GCNs in regard to the node degree distribution. |
144 | QSAN: A Quantum-probability based Signed Attention Network for Explainable False Information Detection | Tian Tian; Yudong Liu; Xiaoyu Yang; Yuefei Lyu; Xi Zhang; Binxing Fang; | In this paper, we focus on how to learn from the post contents and related comments in social media to understand and detect the false information more effectively, with explainability. |
145 | Quaternion-Based Self-Attentive Long Short-term User Preference Encoding for Recommendation | Thanh Tran; Di You; Kyumin Lee; | In this paper, we fully utilize Quaternion space to model both user’s long-term and short-term preferences. |
146 | E-Commerce Dispute Resolution Prediction | David Tsurel; Michael Doron; Alexander Nus; Arnon Dagan; Ido Guy; Dafna Shahaf; | In this work we take a first step towards automatically assisting human agents in dispute resolution at scale. |
147 | When Inverse Propensity Scoring does not Work: Affine Corrections for Unbiased Learning to Rank | Ali Vardasbi; Harrie Oosterhuis; Maarten de Rijke; | Our main contribution is a new estimator based on affine corrections: it both reweights clicks and penalizes items displayed on ranks with high trust bias. |
148 | A Graph Matching Attack on Privacy-Preserving Record Linkage | Anushka Vidanage; Peter Christen; Thilina Ranbaduge; Rainer Schnell; | In this paper we present a novel attack on PPRL methods that exploits the approximate similarities calculated between encoded records. |
149 | Semi-Supervised Max-Sum Clustering | Konstantin Voevodski; | We present computational experiments to show that our framework is effective for clustering text data – we are able to find clusterings that are close to the queried clustering and have a good objective value. |
150 | Efficient Sampling Algorithms for Approximate Temporal Motif Counting | Jingjing Wang; Yanhao Wang; Wenjun Jiang; Yuchen Li; Kian-Lee Tan; | In this paper, we focus on approximate temporal motif counting via random sampling. |
151 | Streaming Graph Neural Networks via Continual Learning | Junshan Wang; Guojie Song; Yi Wu; Liang Wang; | In this paper, we propose a streaming GNN model based on continual learning so that the model is trained incrementally and up-to-date node representations can be obtained at each time step. |
152 | Soap: Soaking Capacity Optimization for Multi-Document Summarization | Kexiang Wang; Baobao Chang; Zhifang Sui; | In this paper, we consider the MDS task as an optimization problem with a novel measure named soaking capacity being the objective function. |
153 | Mining Infrequent High-Quality Phrases from Domain-Specific Corpora | Li Wang; Wei Zhu; Sihang Jiang; Sheng Zhang; Keqiang Wang; Yuan Ni; Guotong Xie; Yanghua Xiao; | In this paper, we focus on mining high-quality phrases from domain-specific corpora with special consideration of infrequent ones. |
154 | Graph Few-shot Learning with Attribute Matching | Ning Wang; Minnan Luo; Kaize Ding; Lingling Zhang; Jundong Li; Qinghua Zheng; | To tackle the aforementioned problem, we propose a novel graph meta-learning framework–Attribute Matching Meta-learning Graph Neural Networks (AMM-GNN). |
155 | Multi-task Adversarial Spatial-Temporal Networks for Crowd Flow Prediction | Senzhang Wang; Hao Miao; Hao Chen; Zhiqiu Huang; | In this paper, we study the novel problem of predicting the crowd flow and flow OD simultaneously, and propose a multi-task adversarial spatial-temporal network model entitled MT-ASTN to effectively address it. |
156 | Negative Confidence-Aware Weakly Supervised Binary Classification for Effective Review Helpfulness Classification | Xi Wang; Iadh Ounis; Craig Macdonald; | In this work, we propose a novel Negative Confidence-aware Weakly Supervised approach (NCWS), which customises a binary classification loss function by discriminating the unlabelled examples with different negative confidences during the classifier’s training. |
157 | Fast Graph Convolution Network Based Multi-label Image Recognition via Cross-modal Fusion | Yangtao Wang; Yanzhao Xie; Yu Liu; Ke Zhou; Xiaocui Li; | To overcome this shortcoming, in this paper, we introduce Multi-modal Factorized Bilinear pooling (MFB) which works as an efficient component to fuse cross-modal embeddings and propose F-GCN, a fast graph convolution network (GCN) based multi-label image recognition model. |
158 | Bringing Order to Network Embedding: A Relative Ranking based Approach | Yaojing Wang; Guosheng Pan; Yuan Yao; Hanghang Tong; Hongxia Yang; Feng Xu; Jian Lu; | In this paper, we propose to go beyond such pointwise approaches, and introduce the ranking-oriented design principle for network embedding. |
159 | Efficient Knowledge Graph Validation via Cross-Graph Representation Learning | Yaqing Wang; Fenglong Ma; Jing Gao; | Towards effective KG validation, we propose to leverage an external human-curated KG as auxiliary information source to help detect the errors in a target KG. |
160 | DisenHAN: Disentangled Heterogeneous Graph Attention Network for Recommendation | Yifan Wang; Suyao Tang; Yuntong Lei; Weiping Song; Sheng Wang; Ming Zhang; | In this paper, we propose a novel disentangled heterogeneous graph attention network DisenHAN for top-N recommendation, which learns disentangled user/item representations from different aspects in a heterogeneous information network. |
161 | Succinct Adaptive Manifold Transfer | Pengfei Wei; Yiping Ke; Zhiqiang Xu; Tze Yun Leong; | In this paper, we propose an effective and efficient Gaussian process (GP) modelling framework, mTGPmk, that can explicitly model domain relatedness and adaptively control the space as well as the strength of knowledge transfer. |
162 | Personalized Imputation on Wearable-Sensory Time Series via Knowledge Transfer | Xian Wu; Stephen Mattingly; Shayan Mirjafari; Chao Huang; Nitesh V. Chawla; | To address the limitation, this work develops a new imputation framework–Personalized Wearable-Sensory Time Series Imputation framework (PTSI) to provide a fully personalized treatment for time series imputation via effective knowledge transfer. |
163 | Providing Direct Answers in Search Results: A Study of User Behavior | Zhijing Wu; Mark Sanderson; B. Barla Cambazoglu; W. Bruce Croft; Falk Scholer; | To study the impact of providing direct answers in search results on user behavior, we conducted a controlled user study to analyze factors including reading time, eye-tracked attention, and the influence of the quality of answer module content. |
164 | CAFE: Coarse-to-Fine Neural Symbolic Reasoning for Explainable Recommendation | Yikun Xian; Zuohui Fu; Handong Zhao; Yingqiang Ge; Xu Chen; Qiaoying Huang; Shijie Geng; Zhou Qin; Gerard de Melo; S. Muthukrishnan; Yongfeng Zhang; | To this end, we propose a CoArse-to-FinE neural symbolic reasoning approach (CAFE). |
165 | OPHiForest: Order Preserving Hashing Based Isolation Forest for Robust and Scalable Anomaly Detection | Haolong Xiang; Zoran Salcic; Wanchun Dou; Xiaolong Xu; Lianyong Qi; Xuyun Zhang; | In this paper, we propose a novel anomaly detection method named OPHiForest with the use of the order preserving hashing based isolation forest. |
166 | Deep Graph Convolutional Networks for Incident-Driven Traffic Speed Prediction | Qinge Xie; Tiancheng Guo; Yang Chen; Yu Xiao; Xin Wang; Ben Y. Zhao; | In this work, we aim to make use of the information of incidents to achieve a better prediction of traffic speed. |
167 | Controllable Multi-Character Psychology-Oriented Story Generation | Feifei Xu; Xinpeng Wang; Yunpu Ma; Volker Tresp; Yuyi Wang; Shanlin Zhou; Haizhou Du; | In our work, we aim to design an emotional line for each character that considers multiple emotions common in psychological theories, with the goal of generating stories with richer emotional changes in the characters. |
168 | Schema2QA: High-Quality and Low-Cost Q&A Agents for the Structured Web | Silei Xu; Giovanni Campagna; Jian Li; Monica S. Lam; | This paper proposes Schema2QA, an open-source toolkit that can generate a Q&A system from a database schema augmented with a few annotations for each field. |
169 | E-commerce Recommendation with Weighted Expected Utility | Zhichao Xu; Yi Han; Yongfeng Zhang; Qingyao Ai; | In this paper, we interpret purchase utility as the satisfaction level a consumer gets from a product and propose a recommendation framework using EU to model consumers’ behavioral patterns. |
170 | NHP: Neural Hypergraph Link Prediction | Naganand Yadati; Vikram Nitin; Madhav Nimishakavi; Prateek Yadav; Anand Louis; Partha Talukdar; | We propose two variants of NHP — NHP-U and NHP-D — for link prediction over undirected and directed hypergraphs, respectively. |
171 | Fair Class Balancing: Enhancing Model Fairness without Observing Sensitive Attributes | Shen Yan; Hsien-te Kao; Emilio Ferrara; | In this paper, we first illustrate the extent to which common balancing techniques exacerbate unfairness in real-world data. |
172 | Beyond 512 Tokens: Siamese Multi-depth Transformer-based Hierarchical Encoder for Long-Form Document Matching | Liu Yang; Mingyang Zhang; Cheng Li; Michael Bendersky; Marc Najork; | In this paper, we address the issue by proposing the Siamese Multi-depth Transformer-based Hierarchical (SMITH) Encoder for long-form document matching. |
173 | NagE: Non-Abelian Group Embedding for Knowledge Graphs | Tong Yang; Long Sha; Pengyu Hong; | Motivated by the theoretical analysis, we have proposed a group theory-based knowledge graph embedding framework, in which relations are embedded as group elements, and entities are represented by vectors in group action spaces. |
174 | LB-CGM: Latent Based Conditional Generative Model with Reliable Distribution Prediction | Yichen Yao; Guozheng Li; Yujie Chen; Rongqi Li; Yinzhi Zhou; Xiaodong Zhang; Haoyuan Hu; Yinghui Xu; | In this paper, we combine the advantages of both GANs (Generative Adversarial Nets) and VAEs (Variational Auto-Encoders), and introduce a latent-based conditional generative model (LB-CGM) to handle the distribution regression problems. |
175 | LSAN: Modeling Long-term Dependencies and Short-term Correlations with Hierarchical Attention for Risk Prediction | Muchao Ye; Junyu Luo; Cao Xiao; Fenglong Ma; | To minimize the effect caused by noise information of EHR data, in this paper, we propose a novel DNN for risk prediction termed as LSAN, which consists of a Hierarchical Attention Module (HAM) and a Temporal Aggregation Module (TAM). |
176 | Logic Enhanced Commonsense Inference with Chain Transformer | Chenxi Yuan; Chun Yuan; Yang Bai; Ziran Li; | To alleviate this issue, we propose Chain Transformer, a logic enhanced commonsense inference model that combines both direct and indirect inferences to construct a logical chain so as to reason in a more logically consistent way. |
177 | Exploring Missing Interactions: A Convolutional Generative Adversarial Network for Collaborative Filtering | Feng Yuan; Lina Yao; Boualem Benatallah; | In this work, we focus on the semantic-rich user-item interactions in a recommender system and propose a novel generative adversarial network (GAN) named Convolutional Generative Collaborative Filtering (Conv-GCF). |
178 | GeneraLight: Improving Environment Generalization of Traffic Signal Control via Meta Reinforcement Learning | Huichu Zhang; Chang Liu; Weinan Zhang; Guanjie Zheng; Yong Yu; | In this paper, we design a novel traffic flow generator based on Wasserstein generative adversarial network to generate sufficient diverse and quality traffic flows and use them to build proper training and testing environments. |
179 | TOMATO: A Topic-Wise Multi-Task Sparsity Model | Jason (Jiasheng) Zhang; Dongwon Lee; | Based on this observation, in this paper, we propose a generalized topic-wise multi-task architecture, to capture the within-topic task relationship, which can be combined with any existing MTL designs. |
180 | More Than One: A Cluster-Prototype Matching Framework for Zero-Shot Learning | Jing Zhang; YangLi-a o Geng; Qingyong Li; Chuan Shi; | Inspired by this, we introduce a novel cluster-prototype matching (CPM) strategy and propose a ZSL framework based on CPM. |
181 | A Feature-Importance-Aware and Robust Aggregator for GCN | Li Zhang; Haiping Lu; | Under this framework, we propose a new model called LA-GCNMask consisting of a new aggregator function,mask aggregator. |
182 | Query Understanding via Intent Description Generation | Ruqing Zhang; Jiafeng Guo; Yixing Fan; Yanyan Lan; Xueqi Cheng; | In this paper, therefore, we propose a novel Query-to-Intent-Description (Q2ID) task for query understanding. |
183 | Generating Categories for Sets of Entities | Shuo Zhang; Krisztian Balog; Jamie Callan; | To aid knowledge editors in the manual process of expanding a category system, this paper presents a method of generating categories for sets of entities. |
184 | CommDGI: Community Detection Oriented Deep Graph Infomax | Tianqi Zhang; Yun Xiong; Jiawei Zhang; Yao Zhang; Yizhu Jiao; Yangyong Zhu; | To overcome the shortcomings of general-purposed graph representation learning methods, we propose the Community Deep Graph Infomax (CommDGI), a graph neural network designed to handle community detection problems. |
185 | Spatial-Temporal Convolutional Graph Attention Networks for Citywide Traffic Flow Forecasting | Xiyue Zhang; Chao Huang; Yong Xu; Lianghao Xia; | To address these challenges, we propose a new traffic prediction framework–Spatial-Temporal Convolutional Graph Attention Network (ST-CGA), to enable the traffic prediction with the modeling of region dependencies, from locally to globally in a comprehensive manner. |
186 | Semi-Supervised Graph-to-Graph Translation | Tianxiang Zhao; Xianfeng Tang; Xiang Zhang; Suhang Wang; | Therefore,in this work, we seek to provide a graph translation model in the semi-supervised scenario. |
187 | Error-Bounded Graph Anomaly Loss for GNNs | Tong Zhao; Chuchen Deng; Kaifeng Yu; Tianwen Jiang; Daheng Wang; Meng Jiang; | In this work, we propose a novel loss function to train GNNs for anomaly-detectable node representations. |
188 | Whole-Chain Recommendations | Xiangyu Zhao; Long Xia; Lixin Zou; Hui Liu; Dawei Yin; Jiliang Tang; | In this paper, we study the recommendation problem with multiple (consecutive) scenarios, i.e., whole-chain recommendations. |
189 | S3-Rec: Self-Supervised Learning for Sequential Recommendation with Mutual Information Maximization | Kun Zhou; Hui Wang; Wayne Xin Zhao; Yutao Zhu; Sirui Wang; Fuzheng Zhang; Zhongyuan Wang; Ji-Rong Wen; | To tackle this problem, we propose the model S3-Rec, which stands for Self-Supervised learning for Sequential Recommendation, based on the self-attentive neural architecture. |
190 | Top-k Graph Summarization on Hierarchical DAGs | Xuliang Zhu; Xin Huang; Byron Choi; Jianliang Xu; | In this paper, we study a new problem of finding k representative vertices to summarize a hierarchical DAG. |
191 | When Structure Meets Keywords: Cohesive Attributed Community Search | Yuanyuan Zhu; Jian He; Junhao Ye; Lu Qin; Xin Huang; Jeffrey Xu Yu; | In this paper, we propose a new cohesive attributed community (CAC) model that can ensure both structure cohesiveness and attribute cohesiveness of communities. |
192 | LRHNE: A Latent-Relation Enhanced Embedding Method for Heterogeneous Information Networks | Zhihua Zhu; Xinxin Fan; Xiaokai Chu; Jianhui Huang; Jingping Bi; | Considering the sophisticated correlations in HINs, we in this paper propose a novel HIN embedding method LRHNE to yield latent-relation enhanced embeddings for nodes. |
193 | Corpus Bootstrapping for Assessment of the Properties of Effectiveness Measures | Justin Zobel; Lida Rashidi; | In this work we consider how bootstrapping can be used to assess the reliability of effectiveness measures for experimental IR. |
194 | Building Test Collections using Bandit Techniques: A Reproducibility Study | Bahadir Altun; Mucahid Kutlu; | In this paper, we conduct a comprehensive set of experiments to evaluate six bandit-based document selection methods, in terms of evaluation reliability, fairness, and reusability of the resultant test collections. |
195 | FDCM: Towards Balanced and Generalizable Concept-based Models for Effective Medical Ranking | Mohammad Bahrani; Thomas Roelleke; | We propose a new aggregation parameter to combine conceptual and term-based Dirichlet Compound Model scores effectively. |
196 | Application Performance Anomaly Detection with LSTM on Temporal Irregularities in Logs | Xavier Baril; Oihana Coustié; Josiane Mothe; Olivier Teste; | In this paper, we develop a new model for performance anomaly detection that captures temporal deviations from the nominal model, by means of a sliding window data representation. |
197 | Automatic Gaussian Process Model Retrieval for Big Data | Fabian Berns; Christian Beecks; | Since non-approximative Gaussian Processes only allow for processing small datasets with low statistical versatility, we propose a new approach that allows to efficiently and automatically retrieve GPMs for large-scale data. |
198 | Query Abandonment Prediction with Recurrent Neural Models of Mouse Cursor Movements | Lukas Brückner; Ioannis Arapakis; Luis A. Leiva; | We show that mouse cursor movements make a valuable, low-cost behavioral signal that can discriminate good and bad abandonment. |
199 | NumClaim: Investor’s Fine-grained Claim Detection | Chung-Chi Chen; Hen-Hsen Huang; Hsin-Hsi Chen; | In this paper, we design a novel task for argument mining in the financial domain, and provide an expert-annotated dataset, NumClaim, for the proposed task. |
200 | Label-Aware Graph Convolutional Networks | Hao Chen; Yue Xu; Feiran Huang; Zengde Deng; Wenbing Huang; Senzhang Wang; Peng He; Zhoujun Li; | In this paper, we consider the problem of node classification and propose the Label-Aware Graph Convolutional Network (LAGCN) framework which can directly identify valuable neighbors to enhance the performance of existing GCN models. |
201 | Graph Unfolding Networks | Hao Chen; Wenbing Huang; Yue Xu; Fuchun Sun; Zhoujun Li; | In this paper, we propose Graph Unfolding Networks (GUNets) as an alternative mechanism of recursive neighborhood aggregation for graph representation learning. |
202 | CONE-Align: Consistent Network Alignment with Proximity-Preserving Node Embedding | Xiyuan Chen; Mark Heimann; Fatemeh Vahedian; Danai Koutra; | To improve this, we propose CONE-Align, which models intra-network proximity with node embeddings and uses them to match nodes across networks after aligning the embedding subspaces. |
203 | Generative Adversarial Attributed Network Anomaly Detection | Zhenxing Chen; Bo Liu; Meiqing Wang; Peng Dai; Jun Lv; Liefeng Bo; | In this paper, we propose a generative adversarial attributed network (GAAN) anomaly detection framework. |
204 | Joint Estimation of User And Publisher Credibility for Fake News Detection | Rajdipa Chowdhury; Sriram Srinivasan; Lise Getoor; | In this work, we look at fake news detection as a problem of estimating the credibility of both the news publishers and users that propagate news articles. |
205 | Learning Discriminative Virtual Sequences for Time Series Classification | Abhilash Dorle; Fangyu Li; Wenzhan Song; Sheng Li; | To address this issue, in this paper, we propose a novel time series classification method named Discriminative Virtual Sequence Learning (DVSL). |
206 | DECWA: Density-Based Clustering using Wasserstein Distance | Nabil El Malki; Robin Cugny; Olivier Teste; Franck Ravat; | The key idea we propose is to use the Wasserstein metric, a powerful tool to measure the distance between p.d.f of sub-clusters. |
207 | Why is That a Background Article: A Qualitative Analysis of Relevance for News Background Linking | Marwa Essam; Tamer Elsayed; | The goal of the study is to shed some light on the relationship between the query articles and the background articles, and provide informative insights for developing more effective background retrieval models. |
208 | Hybrid Dynamic Pruning for Efficient and Effective Query Processing | Wenxiu Fang; Trent G. Marbach; Gang Wang; Xiaoguang Liu; | In this work, we investigate the performance of the main dynamic pruning algorithms in terms of average and tail latency as well as the accuracy of query results, and find that they are complementary. |
209 | Sample Optimization For Display Advertising | Hongliang Fei; Shulong Tan; Pengju Guo; Wenbo Zhang; Hongfang Zhang; Ping Li; | In this paper, we utilize several sample optimization strategies to alleviate the covariate shift problem for training candidate generation models. |
210 | A Reinforced Semi-supervised Neural Network for Helpful Review Identification | Yue Feng; Miao Fan; Mingming Sun; Ping Li; | Therefore, this paper proposes a reinforced semi-supervised neural learning method (abbreviated as RSSNL) for helpful review identification, which can automatically select high-related unlabeled reviews to help training. |
211 | A View-Adversarial Framework for Multi-View Network Embedding | Dongqi Fu; Zhe Xu; Bo Li; Hanghang Tong; Jingrui He; | In this paper, we propose a view-adversarial framework to generate comprehensive and robust multi-view network representations named VANE, which is based on two adversarial games. |
212 | Can Adversarial Weight Perturbations Inject Neural Backdoors | Siddhant Garg; Adarsh Kumar; Vibhor Goel; Yingyu Liang; | In this work we extend the idea of "adversarial perturbations" to the space of model weights, specifically to inject backdoors in trained DNNs, which exposes a security risk of publicly available trained models. |
213 | Estimating Topic Difficulty Using Normalized Discounted Cumulated Gain | Lukas Gienapp; Benno Stein; Matthias Hagen; Martin Potthast; | We introduce a new approach to estimate topic difficulty, which is based on the ratio of systems that achieve an NDCG score that is better than a baseline formed as random ranking of the pool of judged documents. |
214 | The Impact of Negative Relevance Judgments on NDCG | Lukas Gienapp; Maik Fröbe; Matthias Hagen; Martin Potthast; | We show that, instead of zeroing negative labels, a min-max-normalization of NDCG retains its statistical power while improving its reliability and stability. |
215 | Speaker-Aware BERT for Multi-Turn Response Selection in Retrieval-Based Chatbots | Jia-Chen Gu; Tianda Li; Quan Liu; Zhen-Hua Ling; Zhiming Su; Si Wei; Xiaodan Zhu; | In this paper, we study the problem of employing pre-trained language models for multi-turn response selection in retrieval-based chatbots. |
216 | Subsampled Randomized Hadamard Transform for Regression of Dynamic Graphs | Mostafa Haghir Chehreghani; | In this paper, we utilize subsampled randomized Hadamard transform to propose a randomized algorithm for dynamic graphs. |
217 | Learning to Form Skill-based Teams of Experts | Radin Hamidi Rad; Hossein Fani; Mehdi Kargar; Jaroslaw Szlichta; Ebrahim Bagheri; | Instead of forming a large network of experts, we propose to learn relationships among experts and skills through a variational Bayes neural architecture wherein: i) we consider all past team compositions as training instances to predict future teams; ii) we bring scalability for large networks of experts due to the neural architecture; and, iii) we address sparsity by incorporating uncertainty on the neural network’s parameters which yields a richer representation and more accurate team composition. |
218 | GAEAT: Graph Auto-Encoder Attention Networks for Knowledge Graph Completion | Yanfei Han; Quan Fang; Jun Hu; Shengsheng Qian; Changsheng Xu; | In this paper, we present a new approach for knowledge graph completion called GAEAT (Graph Auto-encoder Attention Network Embedding), which can encapsulate both entity and relation features. |
219 | Learning to Re-Rank with Contextualized Stopwords | Sebastian Hofstätter; Aldo Lipani; Markus Zlabinger; Allan Hanbury; | In this work we propose a novel contextualized stopword detection mechanism for neural re-ranking models. |
220 | DATSING: Data Augmented Time Series Forecasting with Adversarial Domain Adaptation | Hailin Hu; MingJian Tang; Chengcheng Bai; | In this work, we have developed, DATSING, a transfer learning-based framework that effectively leverages cross-domain time series latent representations to augment target domain forecasting. |
221 | Homogenization with Explicit Semantics Preservation for Heterogeneous Information Network | Tiancheng Huang; Zifeng Zhuang; Shanshan Zhang; Donglin Wang; | In this paper, a novel explicit homogenization method is proposed to preserve more semantic information, where the latent information of intermediate nodes among each meta-path instance and that among multiple meta-path instances are incorporated into the conventional adjacent matrix (or weight matrix). |
222 | DistilSum:: Distilling the Knowledge for Extractive Summarization | Ruipeng Jia; Yanan Cao; Haichao Shi; Fang Fang; Yanbing Liu; Jianlong Tan; | In this paper, we introduce DistilSum, which contains teacher mechanism and student model. |
223 | T-REX: A Topic-Aware Relation Extraction Model | Woohwan Jung; Kyuseok Shim; | To tackle the problem, we propose a Topic-aware Relation EXtraction (T-REX) model. |
224 | CR-Graph: Community Reinforcement for Accurate Community Detection | Yoonsuk Kang; Jun Seok Lee; Won-Yong Shin; Sang-Wook Kim; | In this paper, we present CR-Graph (community reinforcement on graphs), a novel method that helps existing algorithms to perform more-accurate community detection (CD). |
225 | What Rankers Can be Statistically Distinguished in Multileaved Comparisons? | Makoto P. Kato; Akiomi Nishida; Tomohiro Manabe; Sumio Fujita; Takehiro Yamamoto; | This paper presents findings from an empirical study of multileaved comparisons, an efficient online evaluation methodology, in a commercial Web service. |
226 | A Synopses Data Engine for Interactive Extreme-Scale Analytics | Antonis Kontaxakis; Nikos Giatrakos; Antonios Deligiannakis; | We detail the novel architecture of a Synopses Data Engine (SDE) which combines the virtues of parallel processing and stream summarization towards interactive analytics at scale. |
227 | NASE:: Learning Knowledge Graph Embedding for Link Prediction via Neural Architecture Search | Xiaoyu Kou; Bingfeng Luo; Huang Hu; Yan Zhang; | In this paper, we propose a novel Neural Architecture Search (NAS) framework for the link prediction task. |
228 | Ranking Clarification Questions via Natural Language Inference | Vaibhav Kumar; Vikas Raunak; Jamie Callan; | For the task of ranking clarification questions, we hypothesize that determining whether a clarification question pertains to a missing entry in a given post (on QA forums such as StackExchange) could be considered as a special case of Natural Language Inference (NLI), where both the post and the most relevant clarification question point to a shared latent piece of information or context. |
229 | MetaTPOT: Enhancing A Tree-based Pipeline Optimization Tool Using Meta-Learning | Doron Laadan; Roman Vainshtein; Yarden Curiel; Gilad Katz; Lior Rokach; | In this study, we explore TPOT’s GP process and propose MetaTPOT, an enhanced variant that uses a meta learning-based approach to predict the performance of TPOT’s pipeline candidates. |
230 | Rethinking Operators Placement of Stream Data Application in the Edge | Thomas Lambert; David Guyon; Shadi Ibrahim; | In this paper, we argue that MST should be used as an optimization objective when placing operators. |
231 | An Index Advisor Using Deep Reinforcement Learning | Hai Lan; Zhifeng Bao; Yuwei Peng; | We study the problem of index selection to maximize the workload performance, which is critical to database systems. |
232 | Bridging the Gap between Click and Relevance for Learning-to-Rank with Minimal Supervision | Jae-woong Lee; Young-In Song; Deokmin Haam; Sanghoon Lee; Woo-sik Choi; Jongwuk Lee; | From a different perspective, we propose a simple-yet-effective ranking model, namely wLambdaMART, which estimates the confidence of click data with a few labeled data, instead of learning the propensity weight to reduce the bias from click data. |
233 | Are Negative Links Really Beneficial to Network Embedding?: In-Depth Analysis and Interesting Results | Yeon-Chang Lee; Nayoun Seo; Sang-Wook Kim; | In this paper, we start by pointing out the limitations on the validation of existing signed network embedding (NE) methods. |
234 | Non-local Self-attentive Autoencoder for Genetic Functionality Prediction | Yun Li; Zhe Liu; Lina Yao; Zihuai He; | To tackle this problem, we propose Non-local Self-attentive Autoencoder (NSAE) which applies attention-driven genetic variant modelling. |
235 | Recursive Balanced k-Subset Sum Partition for Rule-constrained Resource Allocation | Zhuo Li; Jiannong Cao; Zhongyu Yao; Wengen Li; Yu Yang; Jia Wang; | To address these limitations, we propose recursive balanced k-subset sum partition (RBkSP), in which iterative ‘cut-one-out’ policy is employed that in each round, only one subset whose weight of tasks sums up to 1/k of the total weight of all tasks is taken out from the set. |
236 | Alike and Unlike: Resolving Class Imbalance Problem in Financial Credit Risk Assessment | Yang Liu; Xiang Ao; Qiwei Zhong; Jinghua Feng; Jiayu Tang; Qing He; | In this paper, we propose a novel adversarial data augmentation method to solve the class imbalance problem in financial credit risk assessment. |
237 | Active Query of Private Demographic Data for Learning Fair Models | Yijun Liu; Chao Lan; | In this paper, we propose a new direction called active demographic query. |
238 | Neural Relation Extraction on Wikipedia Tables for Augmenting Knowledge Graphs | Erin Macdonald; Denilson Barbosa; | We help close this gap with a neural method that uses contextual information surrounding a table in a Wikipedia article to extract relations between entities appearing in the same row of a table or between the entity of said article and entities appearing in the table. |
239 | Fairness-Aware Learning with Prejudice Free Representations | Ramanujam Madhavan; Mohit Wadhwa; | In this paper, we propose a novel algorithm that can effectively identify and treat latent discriminating features. |
240 | A Comparison of Top-k Threshold Estimation Techniques for Disjunctive Query Processing | Antonio Mallia; Michal Siedlaczek; Mengyang Sun; Torsten Suel; | In this paper, we address this issue by reimplementing four major approaches and comparing them in terms of estimation error, running time, likelihood of an overestimate, and end-to-end performance when applied to common classes of disjunctive top-k query processing algorithms. |
241 | Feedback Loop and Bias Amplification in Recommender Systems | Masoud Mansoury; Himan Abdollahpouri; Mykola Pechenizkiy; Bamshad Mobasher; Robin Burke; | In this paper, we propose a method for simulating the users interaction with the recommenders in an offline setting and study the impact of feedback loop on the popularity bias amplification of several recommendation algorithms. |
242 | Diversifying Top-k Point-of-Interest Queries via Collective Social Reach | Stella Maropaki; Sean Chester; Christos Doulkeridis; Kjetil Nørvåg; | Algorithmically, evaluating this set-based notion of diversity is challenging, yet we present several effective algorithms based on (integer) linear programming, a greedy framework, and r-tree distance browsing. |
243 | Transformer Models for Recommending Related Questions in Web Search | Rajarshee Mitra; Manish Gupta; Sandipan Dandapat; | In this paper, we present our system which is based on a Transformer-based neural representation, BERT (Bidirectional Encoder Representations from Transformers), for query, question and corresponding search result snippets. |
244 | Evaluating the Impact of Knowledge Graph Context on Entity Disambiguation Models | Isaiah Onando Mulang’; Kuldeep Singh; Chaitali Prabhu; Abhishek Nadgeri; Johannes Hoffart; Jens Lehmann; | In this paper, we argue that context derived from a knowledge graph (in our case: Wikidata) provides enough signals to inform pretrained transformer models and improve their performance for named entity disambiguation (NED) on Wikidata KG. |
245 | Deep Metric Learning Based on Rank-sensitive Optimization of Top-k Precision | Naoki Muramoto; Hai-Tao Yu; | In this paper, we propose a new method on how to optimize top-k precision in a rank-sensitive manner. |
246 | Gated Heterogeneous Graph Representation Learning for Shop Search in E-commerce | Xichuan Niu; Bofang Li; Chenliang Li; Rong Xiao; Haochuan Sun; Honggang Wang; Hongbo Deng; Zhenzhong Chen; | In this work, we focus on vectorized search matching model for shop search in Taobao. |
247 | A Reproducibility Study of Deep and Surface Machine Learning Methods for Human-related Trajectory Prediction | Bardh Prenkaj; Paola Velardi; Damiano Distante; Stefano Faralli; | In this paper, we compare several deep and surface state-of-the-art machine learning methods for risk prediction in problems that can be modelled as a trajectory of events separated by irregular time intervals. |
248 | CGTR: Convolution Graph Topology Representation for Document Ranking | Yuanyuan Qi; Jiayue Zhang; Yansong Liu; Weiran Xu; Jun Guo; | In this paper, we take the advantage of Graph Convolutional Networks (GCN) to model global word-relation structure of a document to improve context-aware document ranking. |
249 | Representative Negative Instance Generation for Online Ad Targeting | Yuhan Quan; Jingtao Ding; Depeng Jin; Jianbo Yang; Xing Zhou; Yong Li; | In this work, we tackle this problem by improving the quality of negative instances used in training the targeting model. |
250 | Training Sensitivity in Graph Isomorphism Network | Md. Khaledur Rahman; | In this paper, we attempt to fill this gap by studying various alternative functions for a respective module using a diverse set of benchmark datasets. |
251 | Securing Bloom Filters for Privacy-preserving Record Linkage | Thilina Ranbaduge; Rainer Schnell; | In this paper we propose two novel techniques that can be applied on BF encoding to improve privacy against attacks. |
252 | Product Insights: Analyzing Product Intents in Web Search | Nikitha Rao; Chetan Bansal; Subhabrata Mukherjee; Chandra Maddila; | In this work, we study search logs from Bing web search engine to characterize user intents and study user behavior for product search. |
253 | Muse: Multi-query Event Trend Aggregation | Allison Rozet; Olga Poppe; Chuan Lei; Elke A. Rundensteiner; | We propose MUSE (Multi-query Shared Event trend aggregation), the first framework that shares aggregation queries with Kleene patterns while avoiding expensive trend construction. |
254 | Distant Supervision in BERT-based Adhoc Document Retrieval | Koustav Rudra; Avishek Anand; | In this paper, we handle both the issues simultaneously and introduce passage level weak supervision in contrast to standard document level supervision. |
255 | Modelling Regional Crime Risk using Directed Graph of Check-ins | Shakila Khan Rumi; Flora D. Salim; | The mobility dynamics inferred from Foursquare helps us understanding urban social events like crime In this paper, we propose a directed graph from the aggregated movement between regions using Foursquare data. |
256 | Relation Extraction with Self-determined Graph Convolutional Network | Sunil Kumar Sahu; Derek Thomas; Billy Chiu; Neha Sengupta; Mohammady Mahdy; | In this work, we propose a novel model, the Self-determined Graph Convolutional Network (SGCN), which determines a weighted graph using a self-attention mechanism, rather using any linguistic tool. |
257 | A Framework for Analyzing the Impact of Missing Data in Predictive Models | Fabiola Santore; Eduardo C. de Almeida; Wagner H. Bonat; Eduardo H. M. Pena; Luiz Eduardo S. de Oliveira; | We propose a stochastic framework to evaluate the impact of missing data on the performance of predictive models. |
258 | Deep Adaptive Feature Aggregation in Multi-task Convolutional Neural Networks | Zhen Shen; Chaoran Cui; Jin Huang; Jian Zong; Meng Chen; Yilong Yin; | In this paper, we propose a novel Adaptive Feature Aggregation (AFA) layer for multi-task CNNs, in which a dynamic aggregation mechanism is designed to allow each task to adaptively determine the degree to which the feature aggregation of different tasks is needed according to the feature dependencies. |
259 | GGDs: Graph Generating Dependencies | Larissa C. Shimomura; George Fletcher; Nikolay Yakovets; | We propose Graph Generating Dependencies (GGDs), a new class of dependencies for property graphs. |
260 | Do You Really Like Her Post?: Network-Based Analysis for Understanding Like Activities in SNS | Junho Song; Hyekyoung Park; Kyungsik Han; Sang-Wook Kim; | In this paper, we build like networks in Instagram and analyze them through the lens of two salient aspects – friendship and interest – that constitute social networks. |
261 | DREAM: A Dynamic Relation-Aware Model for Social Recommendation | Liqiang Song; Ye Bi; Mengqiu Yao; Zhenyu Wu; Jianming Wang; Jing Xiao; | In this paper, we propose a unified framework named Dynamic RElation-Aware Model (DREAM) for social recommendation, which tries to model both users? |
262 | LogBug: Generating Adversarial System Logs in Real Time | Jingyu Sun; Bingyu Liu; Yuan Hong; | In this paper, to our best knowledge, we take the first step to propose a novel real-time black-box attack framework LogBug in which attackers slightly modify the logs to deviate the analysis result (i.e., evading the anomaly detection) without knowing the learning model and parameters of the log parser. |
263 | TABLE: A Task-Adaptive BERT-based ListwisE Ranking Model for Document Retrieval | Xingwu Sun; Hongyin Tang; Fuzheng Zhang; Yanling Cui; Beihong Jin; Zhongyuan Wang; | In the pre-training stage of TABLE, we present a domain-adaptive strategy. |
264 | DynamicRec: A Dynamic Convolutional Network for Next Item Recommendation | Md Mehrab Tanjim; Hammad A. Ayuubi; Garrison W. Cottrell; | In this paper, we argue that all the dynamics of the item-to-item transition in session-based settings may not be observable at training time. |
265 | Schema-Agnostic Entity Matching using Pre-trained Language Models | Kai-Sheng Teong; Lay-Ki Soon; Tin Tin Su; | In view of this challenge, this paper presents an effective approach for schema-agnostic EM, where having schema-aligned tables is not compulsory. |
266 | Denoising Individual Bias for Fairer Binary Submatrix Detection | Changlin Wan; Wennan Chang; Tong Zhao; Sha Cao; Chi Zhang; | We propose a binary data denoising framework, namely BIND, which optimizes the detection of true patterns by estimating the row- or column-wise mixture distribution of patterns and disparate background, and eliminating the binary attributes that are more likely from the background. |
267 | Dual Autoencoder Network with Swap Reconstruction for Cold-Start Recommendation | Bei Wang; Chenrui Zhang; Hao Zhang; Xiaoqing Lyu; Zhi Tang; | In this paper, we propose a Dual Autoencoder Network (DAN), which implements cross-domain recommendations to cold-start users in an end-to-end manner. |
268 | Embedding Node Structural Role Identity into Hyperbolic Space | Lili Wang; Ying Lu; Chenghan Huang; Soroush Vosoughi; | In this work, we are the first to present a framework to embed the structural roles of nodes into hyperbolic space. |
269 | Calibration of Google Trends Time Series | Robert West; | We address this issue by proposing Google Trends Anchor Bank (G-TAB), an efficient solution for the calibration of Google Trends data. |
270 | Tolerant Markov Boundary Discovery for Feature Selection | Xingyu Wu; Bingbing Jiang; Yan Zhong; Huanhuan Chen; | In this paper, we propose a tolerant MB discovery algorithm (TLMB), which maps the feature space and target space to a reproducing kernel Hilbert space through the conditional covariance operator, to measure the causal information carried by a feature. |
271 | Deep Multi-Interest Network for Click-through Rate Prediction | Zhibo Xiao; Luwei Yang; Wen Jiang; Yi Wei; Yi Hu; Hao Wang; | In this paper, we propose a novel method named as Deep Multi-Interest Network (DMIN) which models user’s latent multiple interests for click-through rate prediction task. |
272 | Learning to Generate Reformulation Actions for Scalable Conversational Query Understanding | Zihan Xu; Jiangang Zhu; Ling Geng; Yang Yang; Bojia Lin; Daxin Jiang; | For action generation, we propose a multi-task learning framework enhanced by coreference resolution, and introduce grammar constraints into the decoding process. |
273 | Enhance Prototypical Network with Text Descriptions for Few-shot Relation Classification | Kaijia Yang; Nantao Zheng; Xinyu Dai; Liang He; Shujian Huang; Jiajun Chen; | In this paper, we propose TD-Proto, which enhances prototypical network with relation and entity descriptions. |
274 | Analysis of Multivariate Scoring Functions for Automatic Unbiased Learning to Rank | Tao Yang; Shikai Fang; Shibo Li; Yulan Wang; Qingyao Ai; | In this paper, we investigate existing multivariate scoring functions and AutoULTR algorithms in theory and prove that permutation invariance is a crucial factor that determines whether a context-aware learning-to-rank model could be applied to existing AutoULTR framework. |
275 | Time-aware Graph Relational Attention Network for Stock Recommendation | Xiaoting Ying; Cong Xu; Jianliang Gao; Jianxin Wang; Zhao Li; | In this paper, we propose a time-aware graph relational attention network (TRAN) for stock recommendation based on return ratio ranking. |
276 | Deep Interaction Machine: A Simple but Effective Model for High-order Feature Interactions | Feng Yu; Zhaocheng Liu; Qiang Liu; Haoli Zhang; Shu Wu; Liang Wang; | Accordingly, we propose a novel Interaction Machine (IM) model. |
277 | Few-shot Insider Threat Detection | Shuhan Yuan; Panpan Zheng; Xintao Wu; Hanghang Tong; | In this work, we propose a novel framework combining the idea of self-supervised pre-training and metric-based few-shot learning to detect insiders. |
278 | Leveraging User Email Actions to Improve Ad-Close Prediction | Oleg Zendel; Yaroslav Fyodorov; Fiana Raiber; Natalia Silberstein; Oren Somekh; Ali Tabaja; | In this work, we focus on email since this is the property in which most ad closes occur. |
279 | Event-Driven Network for Cross-Modal Retrieval | Zhixiong Zeng; Nan Xu; Wenji Mao; | To tackle the challenge, in this paper, we focus on the retrieval task on long text and image, and propose an event-driven network for cross-modal retrieval. |
280 | Integrating Diagnosis Rules into Deep Neural Networks for Bladder Cancer Staging | Cheng Zhang; Xiaodong Yue; Yufei Chen; Ying Lv; | To tackle the problems, we construct the diagnosis rules of bladder cancer staging based on the clinical experiences of tumor penetration into bladder wall. |
281 | Hyper-Substructure Enhanced Link Predictor | Jian Zhang; Jun Zheng; Jinyin Chen; Qi Xuan; | In this paper, we propose an end-to-end deep learning framework, namely hyper-substructure enhanced link predictor (HELP), for link prediction. |
282 | Seasonal-Periodic Subgraph Mining in Temporal Networks | Qianzhen Zhang; Deke Guo; Xiang Zhao; Xinyi Li; Xi Wang; | This motivates us to explore mining seasonal-periodic subgraphs, and the investigation presents a novel model, called maximal σ-periodic $ømega$-seasonal k-subgraph. |
283 | Multiplex Graph Neural Networks for Multi-behavior Recommendation | Weifeng Zhang; Jingwen Mao; Yi Cao; Congfu Xu; | In this paper, a more general model named Multiplex Graph Neural Network (MGNN) is proposed as a remedy. |
284 | Robust Normalized Squares Maximization for Unsupervised Domain Adaptation | Wenju Zhang; Xiang Zhang; Qing Liao; Wenjing Yang; Long Lan; Zhigang Luo; | In this paper, we propose a novel normalized squares maximization (NSM) loss in which the maximum squares is normalized by the sum of squares of class sizes. |
285 | Community Identification in Signed Networks: A K-Truss Based Model | Jun Zhao; Renjie Sun; Qiuyu Zhu; Xiaoyang Wang; Chen Chen; | To better describe the communities, we propose a novel model, named signed k-truss, which leverages the properties of k-truss and balanced triangle. |
286 | An Event-Oriented Neural Ranking Model for News Retrieval | Lin Zhao; Wanhui Qian; Liangjun Zang; Fuqing Zhu; Yijun Lu; Ruixuan Li; Jizhong Han; Songlin Hu; | In this paper, we propose a novel and effective event-oriented neural ranking model for news retrieval (ENRMNR). |
287 | Revisiting Alternative Experimental Settings for Evaluating Top-N Item Recommendation Algorithms | Wayne Xin Zhao; Junhua Chen; Pengfei Wang; Qi Gu; Ji-Rong Wen; | In this paper, we revisit alternative experimental settings for evaluating top-N recommendation algorithms, considering three important factors, namely dataset splitting, sampled metrics and domain selection. |
288 | Dimension Relation Modeling for Click-Through Rate Prediction | Zihao Zhao; Zhiwei Fang; Yong Li; Changping Peng; Yongjun Bao; Weipeng Yan; | In this paper, we find it has clear positive effects on CTR prediction to model such relations and propose a novel Dimension Relation Module (DRM) to capture them through dimension recalibration. |
289 | On-demand Influencer Discovery on Social Media | Cheng Zheng; Qin Zhang; Sean Young; Wei Wang; | To alleviate this limitation, we investigate an on-Demand Influencer Discovery (DID) framework that is able to identify influencers on any subject depicted by a few user-specified keywords, regardless of its popularity on social media. |
290 | Data Augmentation for Graph Classification | Jiajun Zhou; Jie Shen; Qi Xuan; | Towards this, we introduce data augmentation on graphs and present two heuristic algorithms: \emrandom mapping and \emmotif-similarity mapping, to generate more weakly labeled data for small-scale benchmark datasets via heuristic modification of graph structures. |
291 | Diversifying Multi-aspect Search Results Using Simpson’s Diversity Index | Jianghong Zhou; Eugene Agichtein; Surya Kallumadi; | To address this problem, we introduce a novel method by adapting the Simpson’s Diversity Index from biology, which enables a more effective and efficient quadratic search result diversification algorithm. |
292 | Leveraging Historical Interaction Data for Improving Conversational Recommender System | Kun Zhou; Wayne Xin Zhao; Hui Wang; Sirui Wang; Fuzheng Zhang; Zhongyuan Wang; Ji-Rong Wen; | For this purpose, we propose a novel pre-training approach to integrating both item-based preference sequence (from historical interaction data) and attribute-based preference sequence (from conversation data) via pre-training methods. |
293 | Behavior-driven Student Performance Prediction with Tri-branch Convolutional Neural Network | Jian Zong; Chaoran Cui; Yuling Ma; Li Yao; Meng Chen; Yilong Yin; | In this paper, we seek to address the problem by analyzing students’ daily studying and living behavior, which is comprehensively recorded via campus smart cards. |
294 | Multimodal Clustering via Deep Commonness and Uniqueness Mining | Linlin Zong; Faqiang Miao; Xianchao Zhang; Bo Xu; | In this paper, we enhance the mining of modality-common knowledge by extracting the modality-unique knowledge of each modality simultaneously. |
295 | An Empirical Study on Clarifying Question-Based Systems | Jie Zou; Evangelos Kanoulas; Yiqun Liu; | In this work, we conduct an online experiment by deploying an experimental system, which interacts with users by asking clarifying questions against a product repository. |
296 | AutoADR: Automatic Model Design for Ad Relevance | Yiren Chen; Yaming Yang; Hong Sun; Yujing Wang; Yu Xu; Wei Shen; Rong Zhou; Yunhai Tong; Jing Bai; Ruofei Zhang; | In this paper, we propose AutoADR (Automatic model design for AD Relevance) — a novel end-to-end framework to address this challenge, and share our experience to ship these cutting-edge techniques into online Ad Relevance system at Microsoft Bing. |
297 | U-rank: Utility-oriented Learning to Rank with Implicit Feedback | Xinyi Dai; Jiawei Hou; Qing Liu; Yunjia Xi; Ruiming Tang; Weinan Zhang; Xiuqiang He; Jun Wang; Yong Yu; | To this end, we propose a novel ranking framework called U-rank that directly optimizes the expected utility of the ranking list. |
298 | Personalized Bundle Recommendation in Online Games | Qilin Deng; Kai Wang; Minghao Zhao; Zhene Zou; Runze Wu; Jianrong Tao; Changjie Fan; Liang Chen; | In this paper, we target at a practical but less explored recommendation problem named bundle recommendation, which aims to offer a combination of items to users. |
299 | Learning Formatting Style Transfer and Structure Extraction for Spreadsheet Tables with a Hybrid Neural Network Architecture | Haoyu Dong; Jiong Yang; Shi Han; Dongmei Zhang; | In this paper, we propose techniques for table formatting style transfer, i.e., to automatically format a target table according to the style of a reference table. |
300 | The Utility of Context When Extracting Entities From Legal Documents | Jonathan Donnelly; Adam Roegiest; | Inspired by previous work in Named Entity Recognition (NER), we investigate how NER techniques can be leveraged to aid lawyers in this review process. |
301 | Learning to Rank in the Position Based Model with Bandit Feedback | Beyza Ermis; Patrick Ernst; Yannik Stein; Giovanni Zappella; | We propose novel extensions of two well-known algorithms viz. |
302 | Fusing Global Domain Information and Local Semantic Information to Classify Financial Documents | Mengzhen Fan; Dawei Cheng; Fangzhou Yang; Siqiang Luo; Yifeng Luo; Weining Qian; Aoying Zhou; | In this paper, we implement a document classification framework, named GraphSEAT, to classify financial documents for a leading financial information service provider in China. |
303 | MTBRN: Multiplex Target-Behavior Relation Enhanced Network for Click-Through Rate Prediction | Yufei Feng; Fuyu Lv; Binbin Hu; Fei Sun; Kun Kuang; Yang Liu; Qingwen Liu; Wenwu Ou; | In this paper, we propose a new framework named Multiplex Target-Behavior Relation enhanced Network (MTBRN) to leverage multiplex relations between user behaviors and target item to enhance CTR prediction. |
304 | Fine-Tuned Compressed Representations of Vessel Trajectories | Giannis Fikioris; Kostas Patroumpas; Alexander Artikis; Georgios Paliouras; Manolis Pitsikalis; | In this work, we present an extension of this system, that allows the user to fine-tune trajectory compression according to the requirements of a given application. |
305 | Intent-Driven Similarity in E-Commerce Listings | Gilad Fuchs; Yoni Acriche; Idan Hasson; Pavel Petrov; | In this paper we present a method we call "Listing2Query", or "L2Q", which uses a Sequence Labeling approach to learn token importance from our users? |
306 | Impression Pacing for Jobs Marketplace at LinkedIn | Sahin Cem Geyik; Luthfur Chowdhury; Florian Raudies; Wen Pu; Jianqiang Shen; | In this paper, we propose an impression-based spend computation system, hence an impression-based pacing scheme. |
307 | Bid Shading in The Brave New World of First-Price Auctions | Djordje Gligorijevic; Tian Zhou; Bharatbhushan Shetty; Brendan Kitts; Shengjun Pan; Junwei Pan; Aaron Flores; | In this study, we propose a machine learning approach of modeling optimal bid shading for non-censored online first-price ad auctions. |
308 | Prospective Modeling of Users for Online Display Advertising via Deep Time-Aware Model | Djordje Gligorijevic; Jelena Gligorijevic; Aaron Flores; | In this study, we propose a novel deep time-aware approach designed to model sequences of users’ activities and capture implicit temporal signals of users’ conversion intents. |
309 | Learning to Profile: User Meta-Profile Network for Few-Shot Learning | Hao Gong; Qifang Zhao; Tianyu Li; Derek Cho; DuyKhuong Nguyen; | 1) Meta-learning model: In the context of representation learning with e-commerce user behavior data, we propose a meta-learning framework called the Meta-Profile Network, which extends the ideas of matching network and relation network for knowledge transfer and fast adaptation; 2) Encoding strategy: To keep high fidelity of large-scale long-term sequential behavior data, we propose a time-heatmap encoding strategy that allows the model to encode data effectively; 3) Deep network architecture: A multi-modal model combined with multi-task learning architecture is utilized to address the cross-domain knowledge learning and insufficient label problems. |
310 | EdgeRec: Recommender System on Edge in Mobile Taobao | Yu Gong; Ziwen Jiang; Yufei Feng; Binbin Hu; Kaiqi Zhao; Qingwen Liu; Wenwu Ou; | Our work, to our best knowledge, is the first attempt to design and implement the novel Recommender System on Edge (EdgeRec), which achieves Real-time User Perception and Real-time System Feedback. |
311 | Price Forecast with High-Frequency Finance Data: An Autoregressive Recurrent Neural Network Model with Technical Indicators | Yuechun Gu; Da Yan; Sibo Yan; Zhe Jiang; | We propose to adopt an autoregressive recurrent network instead so that the loss of the prediction at every time step contributes to the model training; we also treat a rich set of technical indicators at each time step as covariates to enhance the model input. |
312 | Deep Multifaceted Transformers for Multi-objective Ranking in Large-Scale E-commerce Recommender Systems | Yulong Gu; Zhuoye Ding; Shuaiqiang Wang; Lixin Zou; Yiding Liu; Dawei Yin; | In this paper, we argue that it is crucial to formulate users’ different interests based on multiple types of behaviors and perform multi-task learning for significant improvement in multiple objectives simultaneously. |
313 | A Deep Prediction Network for Understanding Advertiser Intent and Satisfaction | Liyi Guo; Rui Lu; Haoqi Zhang; Junqi Jin; Zhenzhe Zheng; Fan Wu; Jin Li; Haiyang Xu; Han Li; Wenkai Lu; Jian Xu; Kun Gai; | In this paper, we propose a novel Deep Satisfaction Prediction Network (DSPN), which models advertiser intent and satisfaction simultaneously. |
314 | DeText: A Deep Text Ranking Framework with BERT | Weiwei Guo; Xiaowei Liu; Sida Wang; Huiji Gao; Ananth Sankar; Zimeng Yang; Qi Guo; Liang Zhang; Bo Long; Bee-Chung Chen; Deepak Agarwal; | In this paper, we investigate how to build an efficient BERT-based ranking model for industry use cases. |
315 | P-Companion: A Principled Framework for Diversified Complementary Product Recommendation | Junheng Hao; Tong Zhao; Jin Li; Xin Luna Dong; Christos Faloutsos; Yizhou Sun; Wei Wang; | Given one product, how to recommend its complementary products of different types is the key problem we tackle in this work. |
316 | Loan Default Analysis with Multiplex Graph Learning | Binbin Hu; Zhiqiang Zhang; Jun Zhou; Jingli Fang; Quanhui Jia; Yanming Fang; Quan Yu; Yuan Qi; | To address these issues, we develop a novel attributed multiplex graph based loan default detection approach for effectively integrating multiplex relations in financial scenarios. |
317 | Imbalanced Time Series Classification for Flight Data Analyzing with Nonlinear Granger Causality Learning | Hao Huang; Chenxiao Xu; Shinjae Yoo; Weizhong Yan; Tianyi Wang; Feng Xue; | Here, we present a neural network classification model for imbalanced multivariate time series by leveraging the information learned from normal class, which can also learn the nonlinear Granger causality for each class, so that we can pinpoint how time series classes differ from each other. |
318 | Personalized Flight Itinerary Ranking at Fliggy | Jinhong Huang; Yang Li; Shan Sun; Bufeng Zhang; Jin Huang; | To this end, a novel Personalized Flight itinerary Ranking Network (PFRN) is proposed in this paper. |
319 | Learning Effective Representations for Person-Job Fit by Feature Fusion | Junshu Jiang; Songyun Ye; Wei Wang; Jingran Xu; Xiaosheng Luo; | In this paper, we propose to learn comprehensive and effective representations of the candidates and job posts via feature fusion. |
320 | Incorporating User Feedback into Sequence to Sequence Model Training | Michaeel Kazi; Weiwei Guo; Huiji Gao; Bo Long; | In this work, we propose training a model with both the search history and user feedback datasets. |
321 | Magellan: A Personalized Travel Recommendation System Using Transaction Data | Konik Kothari; Dhruv Gelda; Wei Zhang; Hao Yang; | We present Magellan – a personalized travel recommendation system that is built entirely from card transaction data. |
322 | ART (Attractive Recommendation Tailor): How the Diversity of Product Recommendations Affects Customer Purchase Preference in Fashion Industry? | Hyokmin Kwon; Jaeho Han; Kyungsik Han; | This study examines the impact of the ‘diversity’ of product recommendations on the ‘preference’ of a customer, using online/offline data from a leading fashion company. |
323 | AliMeKG: Domain Knowledge Graph Construction and Application in E-commerce | Feng-Lin Li; Hehong Chen; Guohai Xu; Tian Qiu; Feng Ji; Ji Zhang; Haiqing Chen; | In the paper, we systematically introduce how we construct domain knowledge graph from free text, and demonstrate its business value with several applications. |
324 | Peer-inspired Student Performance Prediction in Interactive Online Question Pools with Graph Neural Network | Haotian Li; Huan Wei; Yong Wang; Yangqiu Song; Huamin Qu; | In this paper, we propose a novel approach using Graph Neural Networks (GNNs) to achieve better student performance prediction in interactive online question pools. |
325 | Spending Money Wisely: Online Electronic Coupon Allocation based on Real-Time User Intent Detection | Liangwei Li; Liucheng Sun; Chenwei Weng; Chengfu Huo; Weijun Ren; | In this paper, we decompose the coupon allocation task into two subtasks: the user intent detection task and the allocation task. |
326 | Improving Multi-Scenario Learning to Rank in E-commerce by Exploiting Task Relationships in the Label Space | Pengcheng Li; Runze Li; Qing Da; An-Xiang Zeng; Lijun Zhang; | To solve the above problem, which we name Multi-Scenario Learning to Rank, we propose the Hybrid of implicit and explicit Mixture-of-Experts (HMoE) approach. |
327 | Graph Neural Network for Tag Ranking in Tag-enhanced Video Recommendation | Qi Liu; Ruobing Xie; Lei Chen; Shukai Liu; Ke Tu; Peng Cui; Bo Zhang; Leyu Lin; | In this paper, we propose a novel Graph neural network based tag ranking (GraphTR) framework on a huge heterogeneous network with video, tag, user and media. |
328 | Decoupled Graph Convolution Network for Inferring Substitutable and Complementary Items | Yiding Liu; Yulong Gu; Zhuoye Ding; Junchao Gao; Ziyi Guo; Yongjun Bao; Weipeng Yan; | To fill this gap, we propose a novel solution, namely Decoupled Graph Convolutional Network (DecGCN), to solve the problem of inferring substitutable and complementary items. |
329 | Two-Stage Audience Expansion for Financial Targeting in Marketing | Zhining Liu; Xiao-Fan Niu; Chenyi Zhuang; Yize Tan; Yixiang Mu; Jinjie Gu; Guannan Zhang; | Therefore, we propose a method to incorporate biased feedbacks with seeds in a meta-learning manner to pan for golden seeds from the noisy seed-set. |
330 | Efficiently Training Intelligible Models for Global Explanations | Yin Lou; Yongliang Wang; Shiwei Liang; Yang Dong; | In this work, we aim to improve the training efficiency of GAM. |
331 | TwinBERT: Distilling Knowledge to Twin-Structured Compressed BERT Models for Large-Scale Retrieval | Wenhao Lu; Jian Jiao; Ruofei Zhang; | To address the problem, we present TwinBERT model, which has two improvements: 1) represent query and document separately using twin-structured encoders and 2) each encoder is a highly compressed BERT-like model with less than one third of the parameters. |
332 | Learning to Create Better Ads: Generation and Ranking Approaches for Ad Creative Refinement | Shaunak Mishra; Manisha Verma; Yichao Zhou; Kapil Thadani; Wei Wang; | In particular, given an input ad creative, we study approaches to refine the given ad text and image by: (i) generating new ad text, (ii) recommending keyphrases for new ad text, and (iii) recommending image tags (objects in the image) to select new ad image. |
333 | Personalizing Natural Language Understanding using Multi-armed Bandits and Implicit Feedback | Fabian Moerchen; Patrick Ernst; Giovanni Zappella; | In this work, we describe how we leveraged multi-armed bandits in combination with implicit customer feedback to improve accuracy and personalization of responses to voice request in the music domain. |
334 | MiNet: Mixed Interest Network for Cross-Domain Click-Through Rate Prediction | Wentao Ouyang; Xiuwu Zhang; Lei Zhao; Jinmei Luo; Yu Zhang; Heng Zou; Zhaojie Liu; Yanlong Du; | In this paper, we address this problem and leverage auxiliary data from a source domain to improve the CTR prediction performance of a target domain. |
335 | Learning to Infer User Hidden States for Online Sequential Advertising | Zhaoqing Peng; Junqi Jin; Lan Luo; Yaodong Yang; Rui Luo; Jun Wang; Weinan Zhang; Haiyang Xu; Miao Xu; Chuan Yu; Tiejian Luo; Han Li; Jian Xu; Kun Gai; | In this paper, we model this intention as a latent variable and formulate the problem as a Partially Observable Markov Decision Process (POMDP) where the underlying intents are inferred based on the observable behaviors. |
336 | Search-based User Interest Modeling with Lifelong Sequential Behavior Data for Click-Through Rate Prediction | Qi Pi; Guorui Zhou; Yujing Zhang; Zhe Wang; Lejian Ren; Ying Fan; Xiaoqiang Zhu; Kun Gai; | In this paper, we tackle this problem by designing a new modeling paradigm, which we name as Search-based Interest Model (SIM). |
337 | Category-aware Graph Neural Networks for Improving E-commerce Review Helpfulness Prediction | Xiaoru Qu; Zhao Li; Jialin Wang; Zhipeng Zhang; Pengcheng Zou; Junxiao Jiang; Jiaming Huang; Rong Xiao; Ji Zhang; Jun Gao; | To handle these two challenges, we propose CA-GNN (Category Aware Graph Neural Networks), which uses graph neural networks (GNNs) to identify helpful reviews in a multi-task manner — we employ GNNs with one shared and many item-specific graph convolutions to learn the common features and each item’s specific criterion for classifying reviews simultaneously. |
338 | Expert-in-the-loop AI for Polymer Discovery | Petar Ristoski; Dmitry Yu Zubarev; Anna Lisa Gentile; Nathaniel Park; Daniel Sanders; Daniel Gruhl; Linda Kato; Steve Welch; | We propose a methodology to quickly capture the intent and expertise of a domain expert in order to train personalized AI models for specific tasks. |
339 | An Extensive Investigation of Machine Learning Techniques for Sleep Apnea Screening | Jose F. Rodrigues; Jean-Louis Pepin; Lorraine Goeuriot; Sihem Amer-Yahia; | In this work, we extensively investigate the use of Machine Learning techniques in the task of determining which factors are more revealing with respect to OSA along with a discussion of the challenges to perform such a task. |
340 | Continuous Improvement of Medical Diagnostic Systems with Large Scale Patient Vignette Simulation | Suhrid Satyal; Nick Fletcher; Shameek Ghosh; | Here, we address this challenge using a novel patient vignette simulation algorithm within an iterative clinician-in-the-loop methodology for semi-automatically evaluating and deploying medical diagnostic systems in production. |
341 | Detection of Novel Social Bots by Ensembles of Specialized Classifiers | Mohsen Sayyadiharikandeh; Onur Varol; Kai-Cheng Yang; Alessandro Flammini; Filippo Menczer; | To address these issues, we propose a new supervised learning method that trains classifiers specialized for each class of bots and combines their decisions through the maximum rule. |
342 | ITAD: Integrative Tensor-based Anomaly Detection System for Reducing False Positives of Satellite Systems | Youjin Shin; Sangyup Lee; Shahroz Tariq; Myeong Shin Lee; Okchul Jung; Daewon Chung; Simon S. Woo; | In this work, we introduce an Integrative Tensor-based Anomaly Detection (ITAD) framework to detect anomalies in a satellite system with the goal of minimizing false positives. |
343 | Helix: DGA Domain Embeddings for Tracking and Exploring Botnets | Lior Sidi; Yisroel Mirsky; Asaf Nadler; Yuval Elovici; Asaf Shabtai; | In this paper, we present Helix, a method for tracking and exploring botnets. |
344 | Crime Linkage Based on Textual Hebrew Police Reports Utilizing Behavioral Patterns | Adir Solomon; Amit Magen; Simo Hanouna; Mor Kertis; Bracha Shapira; Lior Rokach; | In this study, we propose an automatic and language independent method for extracting behavioral patterns from police reports. |
345 | AGATHA: Automatic Graph Mining And Transformer based Hypothesis Generation Approach | Justin Sybrandt; Ilya Tyagin; Michael Shtutman; Ilya Safro; | We present AGATHA, a deep-learning hypothesis generation system that learns a data-driven ranking criteria to recommend new biomedical connections. |
346 | Query Understanding for Surfacing Under-served Music Content | Federico Tomasi; Rishabh Mehrotra; Aasish Pappu; Judith Bütepage; Brian Brost; Hugo Galvão; Mounia Lalmas; | We propose a framework to develop query understanding techniques to identify potential non-focused search queries on a music streaming platform, where users’ information needs are non-specific enough to expose under-served content without severely impacting user satisfaction. |
347 | LiFT: A Scalable Framework for Measuring Fairness in ML Applications | Sriram Vasudevan; Krishnaram Kenthapadi; | Motivated by the need to understand and address algorithmic bias in web-scale ML systems and the limitations of existing fairness toolkits, we present the LinkedIn Fairness Toolkit (LiFT), a framework for scalable computation of fairness metrics as part of large ML systems. |
348 | Match Tracing: A Unified Framework for Real-time Win Prediction and Quantifiable Performance Evaluation | Kai Wang; Hao Li; Linxia Gong; Jianrong Tao; Runze Wu; Changjie Fan; Liang Chen; Peng Cui; | To this end, we develop a unified Match Tracing framework (namely, MT), for tackling the win prediction and performance evaluation jointly. |
349 | Masked-field Pre-training for User Intent Prediction | Peng Wang; Jiang Xu; Chunyi Liu; Hao Feng; Zang Li; Jieping Ye; | To overcome data sparsity, we propose a masked-field pre-training framework. |
350 | Efficient Neural Query Auto Completion | Sida Wang; Weiwei Guo; Huiji Gao; Bo Long; | In this paper, we propose an efficient neural QAC system with effective context modeling to overcome these challenges. |
351 | A Joint Inverse Reinforcement Learning and Deep Learning Model for Drivers’ Behavioral Prediction | Guojun Wu; Yanhua Li; Shikai Luo; Ge Song; Qichao Wang; Jing He; Jieping Ye; Xiaohu Qie; Hongtu Zhu; | This paper aims to develop a joint framework of combining inverse reinforcement learning (IRL) with deep learning (DL) regression model, called IRL-DL, to predict drivers’ future behavior in ride-hailing platforms. |
352 | Deep Behavior Tracing with Multi-level Temporality Preserved Embedding | Runze Wu; Hao Deng; Jianrong Tao; Changjie Fan; Qi Liu; Liang Chen; | In this paper, we propose an intuitive and effective embedding method called Multi-level Aligned Temporal Embedding (MATE), which can tackle the temporal irregularity of recent behavior sequence and then align with the long-term periodicity in the activity cycle. |
353 | Zero-Shot Heterogeneous Transfer Learning from Recommender Systems to Cold-Start Search Retrieval | Tao Wu; Ellie Ka-In Chio; Heng-Tze Cheng; Yu Du; Steffen Rendle; Dima Kuzmin; Ritesh Agarwal; Li Zhang; John Anderson; Sarvjeet Singh; Tushar Chandra; Ed H. Chi; Wen Li; Ankit Kumar; Xiang Ma; Alex Soares; Nitin Jindal; Pei Cao; | In this paper, we propose a new Zero-Shot Heterogeneous Transfer Learning framework that transfers learned knowledge from the recommender system component to improve the search component of a content platform. |
354 | Relevance Ranking for Real-Time Tweet Search | Yan Xia; Yu Sun; Tian Wang; Juan Caicedo Carvajal; Jinliang Fan; Bhargav Mangipudi; Lisa Huang; Yatharth Saraf; | Considering the above properties and constraints, we present a relevance ranking system for Tweet search addressing all these challenges at Twitter. |
355 | Generating Full Spatiotemporal Vehicular Paths: A Data Fusion Approach | Nan Xiao; Nan Hu; Liang Yu; Cheng Long; | In this paper, we leverage two types of large traffic datasets – point flows and sample trajectories – to generate the full city-scale vehicular paths. |
356 | Multi-Channel Sellers Traffic Allocation in Large-scale E-commerce Promotion | Shen Xin; Yizhou Ye; Martin Ester; Cheng Long; Jie Zhang; Zhao Li; Kaiying Yuan; Yanghua Li; | To address these problems, we design a Multi-Channel Sellers Traffic Allocation (MCSTA) optimization model to obtain optimal page view (PV) distribution concerning global GMV. |
357 | aDMSCN: A Novel Perspective for User Intent Prediction in Customer Service Bots | Kuan Xu; Chilin Fu; Xiaolu Zhang; Cen Chen; Ya-Lin Zhang; Wenge Rong; Zujie Wen; Jun Zhou; Xiaolong Li; Yu Qiao; | To address these two problems, we propose an attention-based Deep Multi-instance Sequential Cross Network (aDMSCN) to deal with the UIP task. |
358 | GraphSAIL: Graph Structure Aware Incremental Learning for Recommender Systems | Yishi Xu; Yingxue Zhang; Wei Guo; Huifeng Guo; Ruiming Tang; Mark Coates; | In this work, we propose to update GNN-based recommender models incrementally so that the computation time can be greatly reduced and models can be updated more frequently. |
359 | Ranking User Attributes for Fast Candidate Selection in Recommendation Systems | Huichao Xue; | We propose a model that forecasts the relevance of documents matched by each individual attribute. |
360 | Learning to Build User-tag Profile in Recommendation System | Su Yan; Xin Chen; Ran Huo; Xu Zhang; Leyu Lin; | In this paper, we propose a user tag profiling model (UTPM) to study user-tag profiling as a multi-label classification task using deep neural networks. |
361 | You Are How You Use: Catching Gas Theft Suspects among Diverse Restaurant Users | Xiaodu Yang; Xiuwen Yi; Shun Chen; Sijie Ruan; Junbo Zhang; Yu Zheng; Tianrui Li; | In this paper, we propose a gas-theft detection method msRank to discover suspicious restaurant users when only scarce labels are available. |
362 | Query-aware Tip Generation for Vertical Search | Yang Yang; Junmei Hao; Canjia Li; Zili Wang; Jingang Wang; Fuzheng Zhang; Rao Fu; Peixu Hou; Gong Zhang; Zhongyuan Wang; | To address this issue, this paper proposes a query-aware tip generation framework, integrating query information into encoding and subsequent decoding processes. |
363 | BotSpot: A Hybrid Learning Framework to Uncover Bot Install Fraud in Mobile Advertising | Tianjun Yao; Qing Li; Shangsong Liang; Yadong Zhu; | In this work, we propose an anti-fraud method based on heterogeneous graph that incorporates both local context and global context via graph neural networks (GNN) and gradient boosting classifier to detect bot fraud installs at Mobvista, a leading global mobile advertising company. |
364 | Community Mitigation: A Data-driven System for COVID-19 Risk Assessment in a Hierarchical Manner | Yanfang Ye; Yujie Fan; Shifu Hou; Yiming Zhang; Yiyue Qian; Shiyu Sun; Qian Peng; Mingxuan Ju; Wei Song; Kenneth Loparo; | To slow the spread of virus infections and better respond with actionable strategies for community mitigation, leveraging the large-scale and real-time pandemic related data generated from heterogeneous sources (e.g., disease related data, demographic data, mobility data, and social media data), in this work, we propose and develop a data-driven system (named α-satellite), as an initial offering, to provide real-time COVID-19 risk assessment in a hierarchical manner in the United States. |
365 | Who is Delivering My Food?: Detecting Food Delivery Abusers using Variational Reward Inference Networks | DaeYoung Yoon; Simon S. Woo; | In this work, we aim to detect food delivery abusers using unauthorized vehicles, by formulating this problem as a novelty detection over sequential data. |
366 | Elevated Road Network: A Metric Learning Method for Recognizing Whether a Vehicle is on an Elevated Road | Xiaobing Zhang; Hailiang Xu; Jian Yang; Jia Sun; Fan Chen; Leiyun Li; | We propose Elevated Road Network (ERNet), a lightweight and real industrial neural network model for mobile navigation, to solve elevated road recognition fundamentally. |
367 | Predicting Quality of Automated Welding with Machine Learning and Semantics: A Bosch Case Study | Baifan Zhou; Yulia Svetashova; Seongsu Byeon; Tim Pychynski; Ralf Mikut; Evgeny Kharlamov; | In this paper we develop an ML pipeline to predict the spot quality before the actual welding happens. |
368 | Ensembled CTR Prediction via Knowledge Distillation | Jieming Zhu; Jinyang Liu; Weiqi Li; Jincai Lai; Xiuqiang He; Liang Chen; Zibin Zheng; | Instead, our work targets at a new model training strategy based on knowledge distillation (KD). KD is a teacher-student learning framework to transfer knowledge learned from a teacher model to a student model. |
369 | GeoLink Cruises: A Non-Synthetic Benchmark for Co-Reference Resolution on Knowledge Graphs | Reihaneh Amini; Lu Zhou; Pascal Hitzler; | In this paper, we propose the use of the Cruise entities in the GeoLink data repository as a real-world instance matching benchmark for linked data and knowledge graphs. |
370 | MLM: A Benchmark Dataset for Multitask Learning with Multiple Languages and Modalities | Jason Armitage; Endri Kacupaj; Golsa Tahmasebzadeh; Swati; Maria Maleshkova; Ralph Ewerth; Jens Lehmann; | In this paper, we introduce the MLM (Multiple Languages and Modalities) dataset – a new resource to train and evaluate multitask systems on samples in multiple modalities and three languages. |
371 | MindReader: Recommendation over Knowledge Graph Entities with Explicit User Ratings | Anders H. Brams; Anders L. Jakobsen; Theis E. Jendal; Matteo Lissandrini; Peter Dolog; Katja Hose; | To overcome this limitation, we introduce a new dataset, called the MindReader dataset, providing explicit user ratings both for items and for KG entities. |
372 | ORCAS: 20 Million Clicked Query-Document Pairs for Analyzing Search | Nick Craswell; Daniel Campos; Bhaskar Mitra; Emine Yilmaz; Bodo Billerbeck; | This paper describes a click data release related to the TREC Deep Learning Track document corpus. |
373 | TweetsCOV19 – A Knowledge Base of Semantically Annotated Tweets about the COVID-19 Pandemic | Dimitar Dimitrov; Erdal Baran; Pavlos Fafalios; Ran Yu; Xiaofei Zhu; Matthäus Zloch; Stefan Dietze; | In this paper, we describe TweetsCOV19, a publicly available knowledge base of currently more than 8 million tweets, spanning October 2019 – April 2020. |
374 | LensKit for Python: Next-Generation Software for Recommender Systems Experiments | Michael D. Ekstrand; | In this paper, I present the next generation of the LensKit project, re-envisioning the original tool’s objectives as flexible Python package for supporting recommender systems research and development. |
375 | A Multidimensional Dataset Based on Crowdsourcing for Analyzing and Detecting News Bias | Michael Färber; Victoria Burkard; Adam Jatowt; Sora Lim; | In this paper, we firstly aggregate the aspects of news bias in related works by proposing a new annotation schema for labeling news bias. |
376 | Feature Extraction for Large-Scale Text Collections | Luke Gallagher; Antonio Mallia; J. Shane Culpepper; Torsten Suel; B. Barla Cambazoglu; | In this paper, we introduce Fxt, an open-source framework to perform efficient and scalable feature extraction. |
377 | CauseNet: Towards a Causality Graph Extracted from the Web | Stefan Heindorf; Yan Scholten; Henning Wachsmuth; Axel-Cyrille Ngonga Ngomo; Martin Potthast; | Notwithstanding this challenge, we compile CauseNet, a large-scale knowledge base of claimed causal relations between causal concepts. |
378 | Fine-Grained Relevance Annotations for Multi-Task Document Ranking and Question Answering | Sebastian Hofstätter; Markus Zlabinger; Mete Sertkan; Michael Schröder; Allan Hanbury; | In this work, we present FiRA: a novel dataset of Fine-Grained Relevance Annotations. |
379 | SDM-RDFizer: An RML Interpreter for the Efficient Creation of RDF Knowledge Graphs | Enrique Iglesias; Samaneh Jozashoori; David Chaves-Fraga; Diego Collarana; Maria-Esther Vidal; | In this paper, we propose the SDM-RDFizer, an interpreter of the RDF Mapping Language (RML), to transform raw data in various formats into an RDF knowledge graph. |
380 | Web Page Segmentation Revisited: Evaluation Framework and Dataset | Johannes Kiesel; Florian Kneist; Lars Meyer; Kristof Komlossy; Benno Stein; Martin Potthast; | To address this shortcoming, we contribute two resources: (1) An evaluation framework which can be adjusted to downstream tasks by measuring the segmentation similarity regarding visual, structural, and textual elements, and which includes measures for annotator agreement, segmentation quality, and an algorithm for segmentation fusion. (2) The Webis-WebSeg-20 dataset, comprising 42,450~crowdsourced segmentations for 8,490~web pages, outranging existing sources by an order of magnitude. |
381 | The Newspaper Navigator Dataset: Extracting Headlines and Visual Content from 16 Million Historic Newspaper Pages in Chronicling America | Benjamin Charles Germain Lee; Jaime Mears; Eileen Jakeway; Meghan Ferriter; Chris Adams; Nathan Yarasavage; Deborah Thomas; Kate Zwaard; Daniel S. Weld; | We describe our pipeline that utilizes this deep learning model to extract 7 classes of visual content: headlines, photographs, illustrations, maps, comics, editorial cartoons, and advertisements, complete with textual content such as captions derived from the METS/ALTO OCR, as well as image embeddings. |
382 | MAEC: A Multimodal Aligned Earnings Conference Call Dataset for Financial Risk Prediction | Jiazheng Li; Linyi Yang; Barry Smyth; Ruihai Dong; | We introduce a new, large-scale multi-modal, text-audio paired, earnings-call dataset named MAEC, based on S&P 1500 companies. |
383 | Argo Lite: Open-Source Interactive Graph Exploration and Visualization in Browsers | Siwei Li; Zhiyan Zhou; Anish Upadhayay; Omar Shaikh; Scott Freitas; Haekyu Park; Zijie J. Wang; Susanta Routray; Matthew Hull; Duen Horng Chau; | To address these issues, we have developed Argo Lite, a new in-browser interactive graph exploration and visualization tool. |
384 | CC-News-En: A Large English News Corpus | Joel Mackenzie; Rodger Benham; Matthias Petri; Johanne R. Trippas; J. Shane Culpepper; Alistair Moffat; | We describe a static, open-access news corpus using data from the Common Crawl Foundation, who provide free, publicly available web archives, including a continuous crawl of international news articles published in multiple languages. |
385 | PrivacyFL: A Simulator for Privacy-Preserving and Secure Federated Learning | Vaikkunth Mugunthan; Anton Peraire-Bueno; Lalana Kagal; | In this paper, we introduce PrivacyFL, which is an extensible, easily configurable, and scalable simulator for federated learning environments. |
386 | ContentWise Impressions: An Industrial Dataset with Impressions Included | Fernando B. Pérez Maurera; Maurizio Ferrari Dacrema; Lorenzo Saule; Mario Scriminaci; Paolo Cremonesi; | In this article, we introduce the \dataset dataset, a collection of implicit interactions and impressions of movies and TV series from an Over-The-Top media service, which delivers its media contents over the Internet. |
387 | Profiling Entity Matching Benchmark Tasks | Anna Primpeli; Christian Bizer; | This resource paper systematically complements, profiles, and compares 21 entity matching benchmark tasks. |
388 | A Large Test Collection for Entity Aspect Linking | Jordan Ramsdell; Laura Dietz; | Complementing efforts of Nanni et al (2018), we provide a large-scale test collection which is derived from Wikipedia hyperlinks in a dump from 01/01/2020. |
389 | A Dataset of Journalists’ Interactions with Their Readership: When Should Article Authors Reply to Reader Comments? | Julian Risch; Ralf Krestel; | To this end, we present a dataset of dialogs in which journalists of The Guardian replied to reader comments and identify the reasons why. |
390 | Karate Club: An API Oriented Open-Source Python Framework for Unsupervised Learning on Graphs | Benedek Rozemberczki; Oliver Kiss; Rik Sarkar; | We present Karate Club – a Python framework combining more than 30 state-of-the-art graph mining algorithms. |
391 | Little Ball of Fur: A Python Library for Graph Sampling | Benedek Rozemberczki; Oliver Kiss; Rik Sarkar; | In this paper, we describe Little Ball of Fur a Python library that includes more than twenty graph sampling algorithms. |
392 | Falcon 2.0: An Entity and Relation Linking Tool over Wikidata | Ahmad Sakor; Kuldeep Singh; Anery Patel; Maria-Esther Vidal; | In this paper, we present Falcon 2.0, the first joint entity and relation linking tool over Wikidata. |
393 | GeoFlink: A Distributed and Scalable Framework for the Real-time Processing of Spatial Streams | Salman Ahmed Shaikh; Komal Mariam; Hiroyuki Kitagawa; Kyoung-Sook Kim; | To fill this gap, we present GeoFlink, which extends Apache Flink to support spatial data types, indexes and continuous queries over spatial data streams. |
394 | Event-QA: A Dataset for Event-Centric Question Answering over Knowledge Graphs | Tarcísio Souza Costa; Simon Gottschalk; Elena Demidova; | In this paper, we present the Event-QA dataset for answering event-centric questions over knowledge graphs. |
395 | ReQue: A Configurable Workflow and Dataset Collection for Query Refinement | Mahtab Tamannaee; Hossein Fani; Fattane Zarrinkalam; Jamil Samouh; Samad Paydar; Ebrahim Bagheri; | In this paper, we implement and publicly share a configurable software workflow and a collection of gold standard datasets for training and evaluating supervised query refinement methods. |
396 | BioKG: A Knowledge Graph for Relational Learning On Biological Data | Brian Walsh; Sameh K. Mohamed; Vít Nováček; | To make relational learning on biomedical data more standardised and reproducible, we propose a new biological knowledge graph which provides a compilation of curated relational data from open biological databases in a unified format with common, interlinked identifiers. |
397 | Flexible IR Pipelines with Capreolus | Andrew Yates; Kevin Martin Jose; Xinyu Zhang; Jimmy Lin; | In order to improve the flexibility of such toolkits, we propose implementing experimental pipelines as dependency graphs of functional "IR primitives,” which we call modules, that can be used and combined as needed. |
398 | MIMICS: A Large-Scale Data Collection for Search Clarification | Hamed Zamani; Gord Lueck; Everest Chen; Rodolfo Quispe; Flint Luu; Nick Craswell; | In this paper, we introduce MIMICS, a collection of search clarification datasets for real web search queries sampled from the Bing query logs. |
399 | The Enslaved Dataset: A Real-world Complex Ontology Alignment Benchmark using Wikibase | Lu Zhou; Cogan Shimizu; Pascal Hitzler; Alicia M. Sheill; Seila Gonzalez Estrecha; Catherine Foley; Duncan Tarr; Dean Rehberger; | In this paper, we propose a real-world dataset from the Enslaved project as a potential complex alignment benchmark. |
400 | ReCOVery: A Multimodal Repository for COVID-19 News Credibility Research | Xinyi Zhou; Apurva Mulay; Emilio Ferrara; Reza Zafarani; | In this work, we present ReCOVery, a repository designed and constructed to facilitate research on combating such information regarding COVID-19. |