Assistent Professor at the Web Information Systems group of the Faculty of Engineering, Mathematics and Computer Science (EEMCS/EWI), Delft University of Technology.
My research focus is on developing new human-centered query paradigms for modern information systems and databases. Thus, my research covers problems and challenges from several fields like database query processing, data analytics, data and web mining, natural language processing, and crowd intelligence.
In contrast to system-centric query paradigms like keyword search or SQL, human-centric information systems try to realize a more natural user interaction aligned with human perception and human communication behavior.
My research focuses on several different aspects of this larger challenge, ranging from foundational research in the field of databases to application-oriented research aiming at use the developed technologies to domain specific problems like digital library research, e-commerce, or semantic open courseware.
My research interest is in human-centered information systems, and my research explores novel query paradigms like for example analogy queries or similarity queries. I also worked on several other topics in the context of databases, information systems, and Web science like personalization, information extraction, recommender systems, and crowd-intelligence. The core challenge underlying all my research efforts is on how to model and extract the characterizing semantics of data stored in an information system in such a way that the data can be useful to a given user.
2016: Assistant Professor Delft University of Technology
2014-2016: PostDoc at Technische Universität Braunschweig
2012-2014: PostDoc at National Institute of Informatics, Tokyo, Japan
2011-2012: PostDoc at Technische Universität Braunschweig
2011: Defense of Doctoral Thesis at Technische Universität Braunschweig
2008-2011: Ph.D. Researcher at Technische Universität Braunschweig
2006-2008: Ph.D. Researcher at L3S Research Center, Leibniz University Hannover
2005: Diploma Thesis at Collaborative Software Development Laboratory, University of Hawai’i, Honolulu - Manoa, Certificate issued by University Kaiserslautern
2002-2004: Assistant Researcher at Fraunhofer Institute for Experimental Software Engineering, Kaiserslautern
2000-2004: Student at University Kaiserslautern
Publications prior to 2016
Christoph Lofi and Wolf-tilo Balke. "Large Scale Cooperation Scenarios – Crowdsourcing and its Societal Implication". Transactions on Internet Research (TIR), 12(1):03–14, jan 2016. PDF.
Christoph Lofi. "Towards Human-Centered Database Query Processing using on Perceptual Properties". In Dutch-Belgian DataBase Day (DBDBD). Mons, Belgium, 2016. PDF.
Christoph Lofi, Athiq Ahamed, Pratima Kulkarni, and Ravi Thakkar. "Benchmarking semantic capabilities of analogy querying algorithms". In Int. Conf. on Database Systems for Advanced Applications (DASFAA), volume 9642, 463–478. Dallas, TX, USA, 2016. PDF, DOI.
Nestor Alvaro, Mike Conway, Son Doan, Christoph Lofi, John Overington, and Nigel Collier. "Crowdsourcing Twitter annotations to identify first-hand experiences of prescription drug use". Journal of Biomedical Informatics, 58:280–287, nov 2015. PDF.
Christoph Lofi and Philipp Wille. "Exploiting social judgements in big data analytics". In 13th Lernen Wissen Adaption (LWA) Conference, volume 1458, 444–455. Trier, Germany, oct 2015. PDF.
Philipp Wille, Christoph Lofi, and Wolf-Tilo Balke. "Towards Narrative Information Systems". In Web-Age Information Management (WAIM). Qingdao, Shandong, China, jun 2015. PDF.
Kinda El Maarry. "Crowdsourcing for Query Processing on Web Data: A Case Study on the Skyline Operator". Journal of Computing and Information Technology (CIT), mar 2015. PDF.
Christoph Lofi and Christian Nieke. "I would like to watch something like ‘The Terminator'…” Cooperative Query Personalization Based on Perceptual Similarity". In 18th International Conference on Extending Database Technology (EDBT). Brussels, Belgium, 2015. PDF.
Jiyin He, Kai Kunze, Christoph Lofi, K. Madria Sanjay, and Stephan Sigg. "Towards Mobile Sensor-Aware Crowdsourcing: Architecture, Opportunities and Challenges". In DASFAA Workshop on Uncertain and Crowdsourced Data. Bali, Indonesia, 2014. PDF.
Christoph Lofi and Christian Nieke. "Exploiting Perceptual Similarity: Privacy-Preserving Cooperative Query Personalization". In Int. Conf. on Web Information System Engineering (WISE). Thessaloniki, Greece, 2014. PDF.
C. Lofi, C. Nieke, and N. Collier. "Discriminating Rhetorical Analogies in Social Media". In Conf. of the Europ. Chapter of the Association for Computational Linguistics (EACL). Gothenburg, Sweden, 2014. PDF.
Christoph Lofi and Kinda El Maarry. "Design Patterns for Hybrid Algorithmic-Crowdsourcing Workflows". In 16th IEEE Conf. on Business Informatics (CBI). Geneva, Switzerland, 2014. PDF.
Christoph Lofi, Kinda El Maarry, and Wolf-Tilo Balke. "Skyline Queries over Incomplete Data - Error Models for Focused Crowd-Sourcing". In Int. Conf. on Conceptual Modeling (ER). Hong Kong, China, 2013. PDF.
Christoph Lofi, Kinda El Maarry, and Wolf-Tilo Balke. "Skyline Queries in Crowd-Enabled Databases". In Int. Conf. on Extending Database Technology (EDBT). Genoa, Italy, 2013. PDF.
Christoph Lofi and Christian Nieke. "Modeling Analogies for Human-Centered Information Systems". In 5th Int. Conf. On Social Informatics (SocInfo). Kyoto, Japan, 2013. PDF.
Christoph Lofi. "Just ask a human? – Controlling Quality in Relational Similarity and Analogy Processing using the Crowd". In CDIM Workshop at Database Systems for Business Technology and Web (BTW). Magdeburg, Germany, 2013. PDF.
Christoph Lofi. "Analogy Queries in Information Systems – A New Challenge". Journal of Information & Knowledge Management (JIKM), 2013. PDF.
Christoph Lofi and Ralf Krestel. "iParticipate: Automatic Tweet Generation from Local Government Data". In 17th International Conference on Database Systems for Advanced Applications (DASFAA). Busan, South Korea, 2012. PDF.
Joachim Selke, Christoph Lofi, and Wolf-Tilo Balke. "Pushing the Boundaries of Crowd-Enabled Databases with Query-Driven Schema Expansion". Int. Conf. on Very Large Data Bases (VLDB), 5(2):538–549, 2012. PDF.
Christoph Lofi and Wolf-Tilo Balke. "On Skyline Queries and how to Choose from Pareto Sets". In Advanced Query Processing in Intelligent Systems Reference Library (ISRL 36), chapter 2, pages 15–36. Springer, 2012. PDF.
Christoph Lofi, Wolf-Tilo Balke, and Ulrich Güntzer. "Malleability-Aware Skyline Computation on Linked Open Data". In 17th International Conference on Database Systems for Advanced Applications (DASFAA). Busan, South Korea, 2012. PDF.
Christoph Lofi, Wolf-tilo Balke, and Ulrich Güntzer. "Equivalence Heuristics for Malleability-Aware Skylines". Journal of Computing Science and Engineering (JCSE), 6(3):207–218, 2012. PDF.
Silviu Homoceanu, Michael Loster, Christoph Lofi, and Wolf-tilo Balke. "Will I like it ? – Providing Product Overviews based on Opinion Excerpts". In IEEE Conference on Commerce and Enterprise Computing (CEC). Luxembourg, Luxembourg, 2011. PDF.
Christoph Lofi, Wolf-Tilo Balke, and Ulrich Güntzer. "Eliciting Customer Wishes using Example-Based Heuristics in E-Commerce Applications". In IEEE Conference on Commerce and Enterprise Computing (CEC). Luxembourg, Luxembourg, 2011. PDF.
Christoph Lofi. "Choosing the Right Thing: Cooperative Trade-Off Enhanced Skyline Queries". In PhD Workshop at the 28th International Conference On Data Engineering (ICDE). Hannover, Germany, 2011. PDF.
Christoph Lofi and Wolf-Tilo Balke. "Preference Trade-Offs – Towards Manageable Skylines". In 22. GI-Workshop Grundlagen von Datenbanken (GvD). Bad Helmstedt, Germany, 2010. PDF.
Christoph Lofi, Christian Nieke, and Wolf-Tilo Balke. "Mobile Product Browsing Using Bayesian Retrieval". In IEEE Conf. Commerce and Enterprise Comp. (CEC). Shanghai, China, 2010. PDF.
Joachim Selke, Christoph Lofi, and Wolf-Tilo Balke. "Highly Scalable Multiprocessing Algorithms for Preference-Based Database Retrieval". In 15th International Conference on Database Systems for Advanced Applications (DASFAA). Tsukuba, Japan, 2010. PDF.
Christoph Lofi, Ulrich Güntzer, and Wolf-Tilo Balke. "Efficient Computation of Trade-Off Skylines". In 13th International Conference on Extending Database Technology (EDBT). Lausanne, Switzerland, 2010. PDF.
Christoph Lofi, Wolf-Tilo Balke, and Ulrich Güntzer. "Efficient Skyline Refinement Using Trade-Offs". In 3rd International IEEE Conference on Research Challenges in Information Science (RCIS). Fès, Morocco, 2009. PDF, DOI.
Christoph Lofi, Wolf-Tilo Balke, and Ulrich Güntzer. "Efficient Skyline Refinement Using Trade-Offs Respecting Don't-Care Attributes". International Journal of Computer Science and Applications (IJCSA), 6(5):1–29, 2009. PDF.
Christoph Lofi, W.-T. Wolf-Tilo Balke, Ulrich Güntzer, and U. Guntzer. "Efficiently Performing Consistency Checks for Multi-Dimensional Preference Trade-Offs". In 2nd International IEEE Conference on Research Challenges in Information Science (RCIS), volume 5, 271–278. Marakech, Morocco, 2008. IEEE. PDF, DOI.
Christoph Lofi, Wolf-Tilo Balke, and Ulrich Güntzer. "Consistency Check Algorithms for Multi-Dimensional Preference Trade-Offs". International Journal of Computer Science & Applications (IJCSA), 5(3):165–185, 2008. PDF.
Wolf-Tilo Balke, Christoph Lofi, and Ulrich Güntzer. "User Interaction Support for Incremental Refinement of Preference-Based Queries". In 1st International IEEE Conference on Research Challenges in Information Science (RCIS). Ouarzazate, Morocco, 2007. PDF.
Christoph Lofi and Wolf Siberski. "Service Oriented Architectures for Open E-Learning Systems: An Overview of the Prolix Project". In eTeaching & eScience. Hannover, Germany, 2007. PDF.
Wolf-Tilo Balke, Christoph Lofi, and Ulrich Güntzer. "Incremental Trade-Off Management for Preference Based Queries". International Journal of Computer Science & Applications (IJCSA), 4(2):75–91, 2007. PDF.
Wolf-Tilo Balke, Ulrich Güntzer, and Christoph Lofi. "Eliciting Matters - Controlling Skyline Sizes by Incremental Integration of User Preferences". In 12th International Conference on Database Systems for Advanced Applications (DASFAA). Bangkok, Thailand, 2007. PDF.
Juri L. De Coi, Eelco Herder, Arne Koesling, Christoph Lofi, Daniel Olmedilla, Odysseas Papapetrou, and Wolf Siberski. "A Model for Competence Gap Analysis". In 3rd International Conference on Web Information Systems and Technologies (WEBIST). Barcelona, Spain, 2007. PDF, DOI.
Christoph Lofi. "cGQM - Ein zielorientierter Ansatz für kontinuierliche, automatisierte Messzyklen". In 4th National Conference on Software Measurement and Metrics (DASMA MetriKon 2005). Kaiserslautern, Germany, 2005. PDF.
- Perceptual Properties for Relational Databases
Fields: Database Query Processing, Subspace Clustering, Probabilistic Databases, Sentiment Analysis, Recommender Systems.
Some of the most valuable features of Relational Databases are clearly defined schemas with crisp semantics, thus allowing for rich and complex declarative queries. However, this also comes at a cost: the underlying schema must be carefully designed upfront to support queries expected to fulfill the information need of future users, and the modelling of the structured schema should represent the actual nature and semantics of the represented real world entities in such a way that it naturally aligns with the internalized semantics of user issuing the queries. Here, in some application scenarios, this focus on strict schemas can become problematic. As an example, consider an e-commerce scenario focusing on selling experience products like movies, books, music, or games. Here, the perceived properties describing the user experience those products will entail (which, for most people, is the deciding factor for buying the product) are difficult to capture using relational schemas, which thus often leads to a focus on more objective and crisp properties like production year, actor names, or rough genre labels. Thus, many queries users would naturally ask are not supported by the system, as for example queries for movies which “feel” like a given example movie, or movies which feature a “thought-provoking plot”, movies which are “educational”, or “suitable for children” (we call those queries human-centered queries, as they are the queries most humans would use in a natural conversation with another human, but are often not supported by information systems). One of the challenges around perceived properties of experience products is that it is very hard to foresee during schema design time which properties will be relevant for users, and how they are perceived by them (i.e., the challenge of obtaining values for the properties.) Especially, many of these properties might even be subjective, and thus the perception of different users might differ or be even conflicting (e.g., there might be conflicting views on how “funny” a given movie is).
I claim that most of the perceptual information required to support such human-centered queries can be obtained from user-generated judgements as for example ratings, comments, or reviews. This form of feedback, which can be seen as self-motivated crowdsourcing is a promising source of information as such judgements usually cover the perceptual properties and aspects deemed important by the creator of the judgement. However, integrating this rich source of information into the query process is hard due to the aforementioned challenges, and many applications choose not to try an integration at all: e.g., in most applications (like for example web shops), user reviews are simply displayed for manual consumptions, or user ratings might be used within a recommender systems – but usually it is not possible to access the richness of information contained in human judgements in a declarative and explicit relational fashion.
In this line of work, we are exploring the challenge of supporting such human-centered queries focusing on perceptual properties from a database query processing perspective.
The outlined contributions are as follows:
- Developing a general vision of a database system using perceptual properties, and discuss a high-level model of how to integrate perceptual properties into a suitable data model.
- A special focus will be on consensual perceptual properties to deal with subjectivity in user perception, i.e., properties of entities for which the values emerge form a consensus in perception of a larger user base. Also, we introduce multi-consensual properties for which there is not a single, but multiple consensual values.
- Research into both explicit and latent properties. Here, explicit properties have a real-world interpretation which is explainable to users, while latent properties are opaque but still can be used for several query types like similarity queries.
- Investigating how perceptual properties are represented within a database system. A promising candidate is adapting probabilistic databases, coupled with subspace clustering and exploration to deal with both subjectivity and uncertainty of extraction.
- Developing multiple prototype implementations of systems which can extract, store, and process perceptual properties. Each of these implementations focus on a specific subset of the challenge, e.g., extracting explicit properties, or dealing with multi-consensual values. The long-term goal is to aggregate and combine these individual systems into a larger demonstrator which can be used to showcase the research results.
- Semantic Digital Libraries
Fields: Knowledge Extraction, Digital Libraries, Ontology Design.
Research Focus: This research line is an application of the fundamental theory and practice of human-centered information systems as developed in the fundamental research line. It handles (unstructured) text documents and their related (structured) meta-data in the context of digital libraries. In this scenario, the relevant meta-data which would be required to perform human-centric queries is unavailable. Therefore, that missing information again needs to be extracted from both external sources like user judgements, but also from the actual textual document itself. Thus, the focus of this research line is on the domain-specific knowledge extraction and linking techniques required to realize the vision of human-centered information systems for digital libraries.
Domain-Specific Pitch: Academic publications are a central repository of human-knowledge, and are at the core of scientific advancements both in academia itself, but also of industrial progress. However, tapping into this vast repository of knowledge is a daunting and challenging task, as the number of available publications is growing with tremendous speed. Without proper support, it is often hard or even impossible to find relevant publications related to a given problem in a timely fashion. Providing this support has always been the domain of libraries. However, the near exponential growth of con-tent in the recent years together with the shift to digital resources invalidated many well-proven workflows, demanding new solutions suitable for the current age and time. Efforts to make highly specialized academic knowledge accessible need to go beyond simple bibliographic metadata, the current state of the art. Instead, most information search of human users is inherently entity-centric, being in the most central aspect of publications people perceive as relevant. Some domains like medicine and chemistry have realized this trend early on, and invested heavily into annotating scientific publications with their most relevant entities to support more meaningful search and exploration like genes and proteins, chemical structures and molecules, or drug names. However, the efforts are very costly as they still rely heavily on manual curation and semi-manual workflows, and are thus pro-hibitive for many domains which lack the resources for such measures. Thus, in many domains the query capabilities and meta-data availability is insufficient to cope with the user’s information demand. Therefore, in this line of work, I propose to design, develop, and evaluate novel techniques for extracting entity, centric-meta from research publications for human-centered queries in a mostly automatic fashion, and showcase the effectiveness of our approaches in domains which currently lack support of rich semantic academic metadata. Beyond the obvious contributions like providing entity-centric search, offering facetted browsing capabilities, and realizing semantically meaningful recommendation and exploration of content, I can also use the extracted metadata for contributions to the digital library domain itself by tracking trends or the change of topics in a visual-ly appealing and comprehensive fashion.
Outlined contributions are as follows:
- Extend the current state-of-the-art of systems re-search in the digital library domain by covering challenges like analyzing and annotation educational content, sequencing educational content into micro-learning objects, and developing both recommenda-tion and query capabilities
- Developing a demonstrator prototype system which can augment current digital libraries with additional human-centric meta-data
- Developing human-centered query capabilities utilizing that meta-data for innovative new query paradigms, as for example visual exploration or facetted navigation
Example of Domain-Specific Content-Related Metadata
Example of possible analysis techniques: Visualization of Corpus, Trend-tracking
- Semantic Open Courseware
Fields: Knowledge Extraction, Online Education, Recommendation.
Research Focus: This line of research pushes for human-centered information systems in the educational domain, i.e. instead of simply offering learning materials with limited meta-data, content is analyzed with respect to which parts human users consider relevant for a certain information need. Thus, this is another application of my fundamental theory to domain-specific challenges. From a technology point of view, this shares many similarities with the previous of line of work of entity-centric extraction in digital libraries, as many methods, insights, and results can be shared across both research lines. Ultimately, I envision that both research lines can even be unified into a shared prototype implementation which can also bridge and integrate scientific knowledge into university education tailored to individual users/students.
Domain-Specific Pitch: Online education has seen a tremendous growth during the recent years, covering e-learning offers ranging from traditional online courses to Massive Open Online Courses (MOOCs) and private online courses (SPOCs). Additionally, nearly all higher education institutions support their on-site courses by providing the necessary materials like slides or other course material digitally. At the heart of this development are courseware infrastructures, platforms handling the communication between learners and teachers, and storing and distributing all relevant learning objects. Those learning objects, usually created, curated, and tailored with great care and costs by educational or domain experts represent a significant investment. Therefore, it has been a long-term challenge to provide these learning objects as open education resources to a wide public to maximize their impact with the goal to support a wide variance of target audiences like traditional learners in university courses or online courses, professionals which need to obtain focused competencies, but also educators at higher educational institutions to motivate the reuse of high-quality material to free up valuable personal resources. However, while nowadays there a multitude of platforms offering whole courses with varying degree of openness, this vision of easy access to fine-grained open educational resources has still not been fully realized. One reason for this is that current courseware platforms lack semantic and analytic capabilities to support the sophisticated query, search, and recommendation requirements necessary to efficiently serve the specific information needs of individual learners or educators – courseware platforms are mostly used as repositories for storing and statically serving learning resources alongside manually created meta-data with respect to fixed learning paths as provided by the course designer. The goal of this project is to complement current courseware platforms with state-of-the-art semantic analysis capabilities to obtain deep understanding of both the users and the resources stored in a courseware platform to provide personalized access tailored to the individual information need of users. At the core of my suggested solution are micro-learning objects, i.e. the smallest units of thematic cohesion found in learning content. In this project, I aim at identifying, extracting, and semantically annotation such micro-learning objects in an automated fashion. This annotation will cover multiple facets perceived relevant by both learners and educations like topics, didactic intend, required expertise, or perceived attributes (i.e. based on user judgements).
- Previous Projects
(2012-2015) in MovieExplore - Discovering Movies in a Human-Centric Fashion
This project focuses on discovering the perceptual properties of movies (and also other experience items) for enabeling human-centric interaction paradigms. This will allow users to interact with complex product spaces in a more natural and easier to understand fashion.
This project is funded by the presidential office of TU Braunschweig in order to foster and encourage innovation in university didactics. In this project, we develop novel concepts for teaching relational query languages like SQL, unifying approaches from modern didactics with mobile gamification. The goal of this project is to develop and establish a “serious” online game for supporting the B.Sc. lecture “Relational Databases 1”
see http://www.ifis.cs.tu-bs.de/content/sqlalchemist (in German)
(2013-2015) Anaqonda Project - Analogy Queries by Ontology-based Data Analytics
The Anaqonda project deals with intelligent queries und personalization, and focuses especially on analogy queries. This project was funded by DAAD and executed at NII Tokyo.
For more information, see http://www.ifis.cs.tu-bs.de/content/anaqonda
(2006-2010) APIS - Advanced Personalization in Information Services
The APIS project (Advanced Personalization in Information Services) investigates the impact of personalization technology on future information provisioning. Being interdisciplinary between computer science and cognitive sciences a basic belief of the APIS group is that modern information provisioning needs advanced query processing and optimization techniques using and understanding human preferences, usage patterns, conceptual views and (domain) ontologies. At the same time the architectures for information provisioning have to move from monolithical database systems to more open service-oriented infrastructures.
Source Code for our Skyline Simulator & Datasets can be found here.
For more information, see http://www.ifis.cs.tu-bs.de/content/apis
(2006 - 2008) Prolix - Process Oriented Learning and Information Exchange