CIKM '14- Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management

Full Citation in the ACM Digital Library

SESSION: DB Session 1 - Query Processing

Rubato DB: A Highly Scalable Staged Grid Database System for OLTP and Big Data Applications

MaC: A Probabilistic Framework for Query Answering with Machine-Crowd Collaboration

Templated Search over Relational Databases

ExpressQ: Identifying Keyword Context and Search Target in Relational Keyword Queries

Pulling Conjunctive Query Equivalence out of the Bag

SESSION: IR Session 1 - IR Evaluation

Machine-Assisted Search Preference Evaluation

Designing Test Collections for Comparing Many Systems

Multileaved Comparisons for Fast Online Evaluation

A Retrievability Analysis: Exploring the Relationship Between Retrieval Bias and Retrieval Performance

Relevance and Effort: An Analysis of Document Utility

SESSION: R Session 2 - Models

A Latent Semantic Model with Convolutional-Pooling Structure for Information Retrieval

A Comparison of Retrieval Models using Term Dependencies

Meta-Path-Based Ranking with Pseudo Relevance Feedback on Heterogeneous Graph for Citation Recommendation

A Fixed-Point Method for Weighting Terms in Verbose Informational Queries

Term Selection and Result Reranking for Question Retrieval by Exploiting Hierarchical Classification

SESSION: KM Session 1: Social Networks & Social Media I

Analysis on Community Variational Trend in Dynamic Networks

Learning Interactions for Social Prediction in Large-scale Networks

Influence Maximization over Large-Scale Social Networks: A Bounded Linear Approach

Predictability of Distrust with Interaction Data

Optimizing Multi-Relational Factorization Models for Multiple Target Relations

SESSION: KM Session 2 - Classification I

Learning to Propagate Rare Labels

A Mixtures-of-Trees Framework for Multi-Label Classification

Solving Linear SVMs with Multiple 1D Projections

Adding Robustness to Support Vector Machines Against Adversarial Reverse Engineering

Active Learning based Survival Regression for Censored Data

SESSION: KM Session 3 - Recommenders & Collaborative Filtering I

Collaborative Filtering Incorporating Review Text and Co-clusters of Hidden User Communities and Item Groups

Leveraging Social Connections to Improve Personalized Ranking for Collaborative Filtering

Deviation-Based Contextual SLIM Recommenders

User Interests Imbalance Exploration in Social Recommendation: A Fitness Adaptation

CARS2: Learning Context-aware Representations for Context-aware Recommendations

SESSION: IR Session 3 - Linguistics

Incremental Update Summarization: Adaptive Sentence Selection based on Prevalence and Novelty

A Dynamic Reconstruction Approach to Topic Summarization of User-Generated-Content

Using Crowdsourcing to Investigate Perception of Narrative Similarity

Correct Me If I'm Wrong: Fixing Grammatical Errors by Preposition Ranking

SESSION: IR Session 4 - Community QA & Social Search

Mining Semi-Structured Online Knowledge Bases to Answer Natural Language Questions on Community QA Websites

Improving Term Weighting for Community Question Answering Search Using Syntactic Analysis

Social Book Search Reranking with Generalized Content-Based Filtering

Question Retrieval with High Quality Answers in Community Question Answering

SESSION: KM Session 4 - Social Networks & Social Media II

Controllable Information Sharing for User Accounts Linkage across Multiple Online Social Networks

Identifying Your Customers in Social Networks

Learning a Linear Influence Model from Transient Opinion Dynamics

Modeling Paying Behavior in Game Social Networks

SESSION: KM Session 5 - Classification II

Enabling Precision/Recall Preferences for Semi-supervised SVM Training

A Cross-modal Multi-task Learning Framework for Image Annotation

Multi-task Multi-view Learning for Heterogeneous Tasks

Multi-task Sparse Structure Learning

SESSION: KM Session 6 - Recommenders & Collaborative Filtering I

Truth Discovery in Crowdsourced Detection of Spatial Events

Maximizing Multi-scale Spatial Statistical Discrepancy

Mining and Planning Time-aware Routes from Check-in Data

High Impact Academic Paper Prediction Using Temporal and Topological Features

SESSION: DB Session 2 - Knowledge Base & Data Semantics

Robust Entity Linking via Random Walks

SemStore: A Semantic-Preserving Distributed RDF Triple Store

Pattern Match Query in a Large Uncertain Graph

Semantic Approximate Keyword Query Based on Keyword and Query Coupling Relationship Analysis

SESSION: IR Session 5 - Users

The Effects of Vertical Rank and Border on Aggregated Search Coherence and Search Behavior

An Eye-tracking Study of User Interactions with Query Auto Completion

Improving Tail Query Performance by Fusion Model

Predicting Search Task Difficulty at Different Search Stages

Re-call and Re-cognition in Episode Re-retrieval: A User Study on News Re-finding a Fortnight Later

SESSION: IR Session 6 - Query Intent

Online Exploration for Detecting Shifts in Fresh Intent

Effect of Intent Descriptions on Retrieval Evaluation

Search Result Diversification via Filling Up Multiple Knapsacks

Query Augmentation based Intent Matching in Retail Vertical Ads

SESSION: KM Session 7 - Social Networks & Social Media III

Sketch-based Influence Maximization and Computation: Scaling up with Guarantees

Active Exploration in Networks: Using Probabilistic Relationships for Learning and Inference

Modeling Topic Diffusion in Multi-Relational Bibliographic Information Networks

Graph-based Point-of-interest Recommendation with Geographical and Temporal Influences

On Building Decision Trees from Large-scale Data in Applications of On-line Advertising

SESSION: KM Session 8 - Clustering and Ranking

Improving Co-Cluster Quality with Application to Product Recommendations

Focusing Decomposition Accuracy by Personalizing Tensor Decomposition (PTD)

Ranking-based Clustering on General Heterogeneous Information Networks by Network Projection

NCR: A Scalable Network-Based Approach to Co-Ranking in Question-and-Answer Sites

Similarity Search using Concept Graphs

SESSION: KM Session 9 - Recommenders & Collaborative Filtering II

"Strength Lies in Differences": Diversifying Friends for Recommendations through Subspace Clustering

Exploiting Geographical Neighborhood Characteristics for Location Recommendation

Increasing the Responsiveness of Recommended Expert Collaborators for Online Open Projects

Dual-Regularized One-Class Collaborative Filtering

HGMF: Hierarchical Group Matrix Factorization for Collaborative Recommendation

SESSION: DB Session 3 - Social and Graph Data

SocialTransfer: Transferring Social Knowledge for Cold-Start Cowdsourcing

CAST: A Context-Aware Story-Teller for Streaming Social Content

Distributed Graph Summarization

Efficient Probabilistic Supergraph Search Over Large Uncertain Graphs

SESSION: IR Session 7: Exploratory Search

Narrow or Broad?: Estimating Subjective Specificity in Exploratory Search

Supporting Complex Search Tasks

Extending Faceted Search to the General Web

From Skimming to Reading: A Two-stage Examination Model for Web Search

SESSION: KM Session 10: Text Data Mining I

Generative Modeling of Entity Comparisons in Text

How Many Folders Do You Really Need?: Classifying Email into a Handful of Categories

Latent Aspect Mining via Exploring Sparsity and Intrinsic Information

Recognizing Humor on Twitter

SESSION: KM Session 11: Knowledge Representation & Reasoning I

Towards Consistency Checking over Evolving Ontologies

A Practical Fine-grained Approach to Resolving Incoherent OWL 2 DL Terminologies

Domain Cartridge: Unsupervised Framework for Shallow Domain Ontology Construction from Corpus

Faceted Search over Ontology-Enhanced RDF Data

SESSION: DB Session 4 - Data Integration and Big Data

DFD: Efficient Functional Dependency Discovery

Estimating the Number and Sizes of Fuzzy-Duplicate Clusters

Efficient Static and Dynamic In-Database Tensor Decompositions on Chunk-Based Array Stores

Efficient Filter Approximation Using the Earth Mover's Distance in Very Large Multimedia Databases with Feature Signatures

SESSION: IR Session 8: Social Media

Time-Aware Rank Aggregation for Microblog Search

Tagging Your Tweets: A Probabilistic Modeling of Hashtag Annotation in Twitter

People Search within an Online Social Network: Large Scale Analysis of Facebook Graph Search Query Logs

Automatic Social Circle Detection Using Multi-View Clustering

SESSION: IR Session 9: Machine Learning

Semantic Compositionality in Tree Kernels

Focused Crawling for Structured Data

Ranking Optimization with Constraints

Supervised Nested PageRank

SESSION: KM Session 12: Text Data Mining II

Concept-based Short Text Classification and Ranking

EgoCentric: Ego Networks for Knowledge-based Short Text Classification

A Cross-Lingual Joint Aspect/Sentiment Model for Sentiment Analysis

Microblog Topic Contagiousness Measurement and Emerging Outbreak Monitoring

SESSION: KM Session 13: Mining Data Streams

Fast, Accurate, and Space-efficient Tracking of Time-weighted Frequent Items from Data Streams

GI-NMF: Group Incremental Non-Negative Matrix Factorization on Data Streams

Active Learning for Streaming Networked Data

Online User Location Inference Exploiting Spatiotemporal Correlations in Social Streams

SESSION: KM Session 14: Data Mining Theory & Methods

Robust Principal Component Analysis with Missing Data

Model Selection with the Covering Number of the Ball of RKHS

A Flexible Framework for Projecting Heterogeneous Data

Fair Allocation in Online Markets

Understanding the Sparsity: Augmented Matrix Factorization with Sampled Constraints on Unobservables

SESSION: KM Session 15: Knowledge Representation & Reasoning II

Structure Learning via Parameter Learning

Scalable Distributed Belief Propagation with Prioritized Block Updates

RC-NET: A General Framework for Incorporating Knowledge into Word Representations

On Independence Atoms and Keys

Rebuilding the Tower of Babel: Towards Cross-System Malware Information Sharing

SESSION: KM Session 16: Large- Scale Machine Learning

Computing Multi-Relational Sufficient Statistics for Large Databases

Distributed Stochastic ADMM for Matrix Factorization

Data/Feature Distributed Stochastic Coordinate Descent for Logistic Regression

Exploring Ensemble of Models in Taxonomy-based Cross-Domain Sentiment Classification

Verifiable UML Artifact-Centric Business Process Models

SESSION: KM Session 17: Web Data Mining

Transfer Understanding from Head Queries to Tail Queries

What a Nasty Day: Exploring Mood-Weather Relationship from Twitter

Twitter Opinion Topic Model: Extracting Product Opinions from Tweets by Leveraging Hashtags and Sentiment Lexicon

Analysis of Physical Activity Propagation in a Health Social Network

Predicting the Popularity of Online Serials with Autoregressive Models

SESSION: KM Session 18: Data Mining Applications & Bioinformatics

Sequential Action Patterns in Collaborative Ontology-Engineering Projects: A Case-Study in the Biomedical Domain

Towards Pathway Variation Identification: Aligning Patient Records with a Care Pathway

PatentDom: Analyzing Patent Relationships on Multi-View Patent Graphs

Exploring Legal Patent Citations for Patent Valuation

Tracking Temporal Dynamics of Purchase Decisions via Hierarchical Time-Rescaling Model

SESSION: DB Session 5 - Systems and Applications

Robust and Skew-resistant Parallel Joins in Shared-Nothing Systems

SharkDB: An In-Memory Column-Oriented Trajectory Storage

Deal or deceit: detecting cheating in distribution channels

An Appliance-Driven Approach to Detection of Corrupted Load Curve Data

SESSION: IR Session 10: Engagement, Social, Crowdsourcing

Understanding Within-Content Engagement through Pattern Analysis of Mouse Gestures

Modelling and Detecting Changes in User Satisfaction

"Picture the scene...";: Visually Summarising Social Media Events

Competitive Game Designs for Improving the Cost Effectiveness of Crowdsourcing

SESSION: IR Session 11: Semantics

Cross-Modality Submodular Dictionary Learning for Information Retrieval

A Word-Scale Probabilistic Latent Variable Model for Detecting Human Values

Searching Locally-Defined Entities

Customized Organization of Social Media Contents using Focused Topic Hierarchy

SESSION: KM Session 19: Graph Data Mining I

Sampling Triples from Restricted Networks using MCMC Strategy

Efficient Subgraph Skyline Search Over Large Graphs

Within-Network Classification Using Radius-Constrained Neighborhood Patterns

Pushing the Envelope in Graph Compression

SESSION: DB Session 6 - Privacy and Streams

PraDa: Privacy-preserving Data-Deduplication-as-a-Service

Aroma: A New Data Protection Method with Differential Privacy and Accurate Query Answering

Fast Heuristics for Near-Optimal Task Allocation in Data Stream Processing over Clusters

Truth Discovery in Data Streams: A Single-Pass Probabilistic Approach

SESSION: IR Session 12: Efficiency

Time-sensitive Personalized Query Auto-Completion

Document Prioritization for Scalable Query Processing

Analytical Performance Modeling for Top-K Query Processing

Compact Auxiliary Dictionaries for Incremental Compression of Large Repositories

SESSION: IR Session 13: Domain, Semistructured, Mobile

Modelling Relevance towards Multiple Inclusion Criteria when Ranking Patients.

Relationship Emergence Prediction in Heterogeneous Networks through Dynamic Frequent Subgraph Mining

Query-Driven Mining of Citation Networks for Patent Citation Retrieval and Recommendation

Cross-Device Search

SESSION: KM Session 20: Entity and Feature Extraction

Canonicalizing Open Knowledge Bases

A Fresh Look on Knowledge Bases: Distilling Named Events from News

Exploring Features for Complicated Objects: Cross-View Feature Selection for Multi-Instance Learning

On Efficient Meta-Level Features for Effective Text Classification

SESSION: KM Session 21: Graph Data Mining II

Scalable Vaccine Distribution in Large Graphs given Uncertain Data

Component Detection in Directed Networks

MapReduce Triangle Enumeration With Guarantees

Hotspot Detection in a Service-Oriented Architecture


Hashcube: A Data Structure for Space- and Query-Efficient Skycube Compression

Distance or Coverage?: Retrieving Knowledge-Rich Documents From Enterprise Text Collections

Indexing Linked Data in a Wireless Broadcast System with 3D Hilbert Space-Filling Curves

Towards Efficient Dissemination of Linked Data in the Internet of Things

Tell Me What You Want and I Will Tell Others Where You Have Been

Forest-Based Dynamic Sorted Neighborhood Indexing for Real-Time Entity Resolution

Travel distance versus navigation complexity: a study on different spatial queries on road networks

Scalable Privacy-Preserving Record Linkage for Multiple Databases

Exploring Tag-Free RFID-Based Passive Localization and Tracking via Learning-Based Probabilistic Approaches


Simple Arabic Stemmer

Phrase Query Optimization on Inverted Indexes

CLIR for Informal Content in Arabic Forum Posts

Head First: Living Labs for Ad-hoc Search Evaluation

Medical Semantic Similarity with a Neural Language Model

Parameter Tuning with User Models: Influencing Aggregate User Behavior in Cluster Based Retrieval Systems

On the Importance of Venue-Dependent Features for Learning to Rank Contextual Suggestions

Modelling Complex Relevance Spaces with Copulas

Identifying Time Intervals of Interest to Queries

Identification of Answer-Seeking Questions in Arabic Microblogs

Size and Source Matter: Understanding Inconsistencies in Test Collection-Based Evaluation

Exploiting Knowledge Structure for Proximity-aware Movie Retrieval Model

Supervised Hashing with Soft Constraints

Probabilistic Classifier Chain Inference via Gibbs Sampling

GPQ: Directly Optimizing Q-measure based on Genetic Programming

Revisiting the Divergence Minimization Feedback Model

Vertical-Aware Click Model-Based Effectiveness Metrics

Query Performance Prediction for Aspect Weighting in Search Result Diversification

Axiomatic Analysis of Cross-Language Information Retrieval

How People Use the Web in Large Indoor Spaces

Succinct Queries for Linking and Tracking News in Social Media

Exploring Shared Subspace and Joint Sparsity for Canonical Correlation Analysis

Query Performance Prediction By Considering Score Magnitude and Variance Together

Log-Bilinear Document Language Model for Ad-hoc Information Retrieval

Sparse Semantic Hashing for Efficient Large Scale Similarity Search

Spatial Verification for Scalable Mobile Image Retrieval

A Generative Model for Generating Relevance Labels from Human Judgments and Click-Logs

Generalized Bias-Variance Evaluation of TREC Participated Systems

Aligning Vertical Collection Relevance with User Intent


Multi-document Hyperedge-based Ranking for Text Summarization

Non-independent Cascade Formation: Temporal and Spatial Effects

What is the Shape of a Cluster?: Structural Comparisons of Document Clusters

Ranking Sentiment Explanations for Review Summarization Using Dual Decomposition

A Meta-reasoner to Rule Them All: Automated Selection of OWL Reasoners Based on Efficiency

Semantic Topology

CONR: A Novel Method for Sentiment Word Identification

Using Local Information to Significantly Improve Classification Performance

Improving Recommendation Accuracy by Combining Trust Communities and Collaborative Filtering

Nonlinear Classification via Linear SVMs and Multi-Task Learning

Dynamic Clustering of Contextual Multi-Armed Bandits

Unsupervised Feature Selection for Multi-View Clustering on Text-Image Web News Data

Enterprise Discussion Analysis

A Problem-Action Relation Extraction Based on Causality Patterns of Clinical Events in Discharge Summaries

Entity Oriented Task Extraction from Query Logs

Modeling Retail Transaction Data for Personalized Shopping Recommendation

Identifying Latent Study Habits by Mining Learner Behavior Patterns in Massive Open Online Courses

Constrained Question Recommendation in MOOCs via Submodularity

Exploit Latent Dirichlet Allocation for One-Class Collaborative Filtering

A Bootstrapping Based Refinement Framework for Mining Opinion Words and Targets

Adaptive Pairwise Preference Learning for Collaborative Recommendation with Implicit Feedbacks


INK: A Cloud-Based System for Efficient Top-k Interval Keyword Search

CoDEM: An Ingenious Tool of Insight into Community Detection in Social Networks

Faceted Exploring for Domain Knowledge over Linked Open Data

Building and Exploring Dynamic Topic Models on the Web

A Demonstration of SearchonTS: An Efficient Pattern Search Framework for Time Series Data

AESTHETICS: Analytics with Strings, Things, and Cats

Accelerometer-based Activity Recognition on Smartphone

Cleanix: A Big Data Cleaning Parfait

Keeping You in the Loop: Enabling Web-based Things Management in the Internet of Things

Anything You Can Do, I Can Do Better: Finding Expert Teams by CrewScout

WiiCluster: a Platform for Wikipedia Infobox Generation

Negative FaceBlurring: A Privacy-by-Design Approach to Visual Lifelogging with Google Glass

TensorDB: In-Database Tensor Manipulation with Tensor-Relational Query Plans

TweetMogaz v2: Identifying News Stories in Social Media

TwinChat: A Twitter and Web User Interactive Chat System


VFDS: An Application to Generate Fast Sample Databases

Knowledge Management for Keyword Search over Data Graphs

Clairvoyant: An Early Prediction System For Video Hits

iMiner: Mining Inventory Data for Intelligent Management

RApID: A System for Real-time Analysis of Information Diffusion in Twitter

RecLand: A Recommender System for Social Networks

MeowsReader: Real-Time Ranking and Filtering of News with Generalized Continuous Top-k Queries

AMiner-mini: A People Search Engine for University

DEESSE: entity-Driven Exploratory and sErendipitous Search SystEm

Manual Annotation of Semi-Structured Documents for Entity-Linking

SmartVenues: Recommending Popular and Personalised Venues in a City

GTE-Rank: Searching for Implicit Temporal Query Results

Exploring Document Collections with Topic Frames

CONDOR: A System for CONstraint DiscOvery and Repair

WORKSHOP SESSION: Workshop Summaries

DTMBIO 2014: International Workshop on Data and Text Mining in Biomedical Informatics

DUBMOD14 - International Workshop on Data-driven User Behavioral Modeling and Mining from Social Media

Seventh Workshop on Exploiting Semantic Annotations in Information Retrieval (ESAIR'14): CIKM 2014 Workshop

LocWeb'14 - 4th International Workshop on Location and the Web: CIKM 2014 Workshop Summary

PIKM 2014: The 7th ACM Workshop for Ph.D. Students in Information and Knowledge Management

PSBD 2014: Overview of the 1st International Workshop on Privacy and Security of Big Data

Web-KR 2014: The 5th International Workshop on Web-scale Knowledge Representation, Retrieval and Reasoning