Hideaki Takeda

Hideaki Takeda
National Institute of Informatics

About

358
Publications
30,267
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
3,911
Citations
Citations since 2016
55 Research Items
1586 Citations
2016201720182019202020212022050100150200250300
2016201720182019202020212022050100150200250300
2016201720182019202020212022050100150200250300
2016201720182019202020212022050100150200250300
Additional affiliations
May 2003 - present
The Graduate University for Advanced Studies
Position
  • Professor (Full)
April 2000 - present
National Institute of Informatics
Position
  • Professor
April 1993 - March 2000
Nara Institute of Science and Technology
Position
  • Professor (Associate)

Publications

Publications (358)
Conference Paper
We are proposing Digital Triplet (D3) to support manufacturing system engineers in executing engineering processes on shop floors, including Kaizen activities, by fully utilizing Digital Twin. The goal of our research is to verify the feasibility and advantages of D3. For this purpose, this paper aims to develop a learning factory to embody D3. Fir...
Article
Information sharing can be effective with structured data. The Semantic Web is mainly aimed at creating standard ontologies. However, users have different preferences and evolving requirements. It is not practical to attempt perfect schema definitions with strict constraints. Creating structured formats should be a collaborative and evolutionary pr...
Conference Paper
Full-text available
This research-in-progress paper presents a novel visualization approach to facilitate understanding of interdisciplinary collaboration within a university using a large-scale dataset of research grant projects in Japanese universities. First, we construct a network of departments in a target university, and then visualize the activeness of interdep...
Chapter
Automated predictions affect many areas of modern life, including risk scores in health care and insurance, potential mates in online dating apps, and recommendations in film and music streaming services.
Conference Paper
We are proposing the concept of ‘Digital Triplet’ (D3) to support manufacturing system engineers in executing engineering processes, including Kaizen, by utilizing Digital Twin. The goal of our research is to verify the feasibility and advantage of D3 by embodying D3. For this purpose, we develop a learning factory based on the D3 concept. First, t...
Article
The objective of the study is to propose a systematic method for acquiring experts’ knowledge that is to be integrated with a digital twin (DT) to construct a production system that is robust against unpredicted changes in a production environment. Especially, the paper focuses on the knowledge acquisition for the high-mix and low-volume (HMLV) pro...
Preprint
Full-text available
In the Open Data era, a large number of table resources have been made available on the Web and data portals. However, it is difficult to directly utilize such data due to the ambiguity of entities, name variations, heterogeneous schema, missing, or incomplete metadata. To address these issues, we propose a novel approach, namely TabEAno, to semant...
Preprint
Full-text available
Entity representations are useful in natural language tasks involving entities. In this paper, we propose new pretrained contextualized representations of words and entities based on the bidirectional transformer. The proposed model treats words and entities in a given text as independent tokens, and outputs contextualized representations of them....
Conference Paper
Full-text available
The embeddings of entities in a large knowledge base (e.g., Wikipedia) are highly beneficial for solving various natural language tasks that involve real world knowledge. In this paper, we present Wikipedia2Vec, a Python-based open-source tool for learning the embed-dings of words and entities from Wikipedia. The proposed tool enables users to lear...
Article
We introduce the features of Common Agriculture Vocabulary (CAVOC) and CAVOC-based services, review other resources for standards in the domains of agriculture and food, and briefly discuss the future direction of agricultural semantics standards in Japan based on reviews. CAVOC has been developed as a core vocabulary for enhancing semantics-level...
Chapter
This paper describes an ontology-based development of activity knowledge on a domain and the system we developed to support it. To understand human activities, it is important to explicitly describe the knowledge of each domain. However, there are some issues of knowledge development: the establishment of the efficient method and process, the impro...
Preprint
Full-text available
The embeddings of entities in a large knowledge base (e.g., Wikipedia) are highly beneficial for solving various natural language tasks that involve real world knowledge. In this paper, we present Wikipedia2Vec, a Python-based open-source tool for learning the embeddings of words and entities from Wikipedia. The proposed tool enables users to learn...
Article
This paper describes the practice of knowledge construction using a domain ontology and procedural knowledge in guitar performance. Domain ontologies are created to understand and reason the domain knowledge. However, most of them are difficult to understand ontological logic and languages for domain experts. It is important to consider the differe...
Article
Full-text available
In recent years, there has been an increasing interest in numerical semantic labeling, in which the meaning of an unknown numerical column is assigned by the label of the most relevant columns in predefined knowledge bases. Previous methods used the p value of a statistical hypothesis test to estimate the relevance and thus strongly depend on the d...
Preprint
Full-text available
This paper presents the design of our system, namely MTab, for Semantic Web Challenge on Tabular Data to Knowledge Graph Matching (SemTab 2019). MTab combines the voting algorithm and the probability models to solve critical problems of the matching tasks. Results on SemTab 2019 show that MTab obtains promising performance for the three matching ta...
Article
Full-text available
Suggestion classification for opinion data is defined as identifying a given utterance by suggestion or non-suggestion class. In this paper, we introduce a method called LLMaxent which is the solution for the cross-domain suggestion classification. LLMaxent is a lifelong machine learning approach using maximum entropy (Maxent). In the course of lif...
Preprint
Full-text available
Semantic labeling is a task of matching unknown data source to labeled data sources. The semantic labels could be properties, classes in knowledge bases or labeled data are manually annotated by domain experts. In this paper, we presentEmbNum, a novel approach to match numerical columns from different table data sources. We use a representation net...
Article
AI-based medical and healthcare devices and systems have unique characteristics including 1) plasticity causing changes in system performance through learning, and need of creating new concepts about the timing of learning and assignment of responsibilities for risk management; 2) unpredictability of system behavior in response to unknown inputs du...
Conference Paper
Linked Open Data (LOD) technology enables web of data and exchangeable knowledge graphs through the Internet. However, the change in knowledge is happened everywhere and every time, and it becomes a challenging issue of linking data precisely because the misinterpretation and misunderstanding of some terms and concepts may be dissimilar under diffe...
Chapter
Based on extracting information from Wikipedia, DBpedia is a large scale knowledge base and makes this one available using Semantic Web and Linked Data principles. Thanks to crowd-sourcing, it currently covers multiples domains in multilingualism. Knowledge is obtained from different Wikipedia editions by effort of contributors around the world. Th...
Article
Full-text available
We frequently encounter unfamiliar entity names (e.g., a persons name or a geographic location) while reading texts such as newspapers, magazines, and Web pages. When this occurs, we typically perform a sequence of tedious actions: select the entity name, submit it to a search engine, and obtain detailed information from Web sites. In this paper, w...
Conference Paper
Full-text available
Modeling, analysis, control, and management of complex social networks represent an important area of interdisciplinary research in an advanced digitalized world. In the last decade social networks have produced significant online applications which are running on top of a modern Internet infrastructure and have been identified as major driver of t...
Conference Paper
Full-text available
We can see thanks to the advances in speech processing and recognition technology, voice-based human-machine interaction system such as personal virtual assistants, chat-bots, and contact centers are becoming increasingly popular. Because of this trend, conversation mining are getting attention of many researchers. In conversation mining, standardi...
Conference Paper
Full-text available
Fully understanding suggestion intents in conversational texts is a complicated process that includes three major stages: user suggestion intents filtering, suggestion domain identification, and arguments extraction of suggestion intents. In the scope of this paper, we study the first phase, that is, building a binary classification model to determ...
Article
Full-text available
We describe a neural network model that jointly learns distributed representations of texts and knowledge base (KB) entities. Given a text in the KB, we train our proposed model to predict entities that are relevant to the text. Our model is designed to be generic with the ability to address various NLP tasks with ease. We train the model using a l...
Book
This book constitutes the combined refereed proceedings of ISWC Satellite Wor shops KEKI and NLP&DBpedia 2016 which were held in conjunction with ISWC 2016 in Kobe, Japan, in October 2016. The 9 papers presented were carefully selected and reviewed from 20 submissions. They focus on the use of linguistic linked open data, the linguistic aspects of...
Conference Paper
The purpose of this paper is to present the higher order formalization of RDF and OWL with setting up ontological meta-modeling criteria through the discussion of Russell’s Ramified Type Theory, which was developed in order to solve Russell Paradox appeared at the last stage in the history of set theory. This paper briefly summarize some of set the...
Conference Paper
This paper proposes Agriculture Activity Ontology (AAO) as a basis of the core vocabulary of agricultural activity. Since concepts of agriculture activities are formed by the various context such as purpose, means, crop, and field, we organize the agriculture activity ontology as a hierarchy of concepts discriminated by various properties such as p...
Article
Most existing methods on research collaborator recommendation focus on promoting collaboration within a specific discipline and exploit a network structure derived from co-authorship or co-citation information. To find collaboration opportunities outside researchers' own fields of expertise and beyond their social network, we present an interdiscip...
Article
Full-text available
Taxonomic knowledge provides a scientific name to each organismal group and is thus indispensable information for understanding biodiversity. However, the various perspectives of classifying organisms and changes in taxonomic knowledge have led to inconsistent classification information among different databases and repositories. To have a precise...
Article
We describe a procedure for constructing a website for publishing open data by focusing on the case of Open DATA METI, a website of the Ministry of Economy, Trade, and Industry of Japan. We developed two sites for publishing open data: a data catalog site and one for searching linked open data (LOD). The former allows users to find relevant data th...
Article
Social networking services like Twitter have been playing an import role in people's daily life since it supports new ways of communicating effectively and sharing information. The advantages of these social network services enable them rapidly growing. However, the rise of social network services is leading to the increase of unwanted, disruptive...
Article
Full-text available
It is crucial to promote interdisciplinary research and recommend collaborators from different research fields via academic database analysis. This paper addresses a problem to characterize researchers' interests with a set of diverse research topics found in a large-scale academic database. Specifically, we first use latent Dirichlet allocati...
Article
Full-text available
Many online social networks have been studied in the last decade, giving us insights into the way people diffuse information, communicate, and organize themselves. In this article, we focus on the emergent organization in massive artistic cooperation. We study the creation process of complex music videos in a platform called Nico Nico Douga. We giv...
Conference Paper
It is known that Semantic Web and Linked Open Data (LOD) are powerful technologies for knowledge management, and explicit knowledge is expected to be presented by RDF format (Resource Description Framework), but normal users are far from RDF due to technical skills required. As we learn, a concept-map or a node-link diagram can enhance the learning...
Chapter
Many researchers identify influentials in a network by their betweenness centrality. Whereas betweenness centrality can be calculated in small, static, connected networks, its calculation in complex, large, evolving networks frequently causes some problems. Hence, we propose a proxy variable for a node’s betweenness centrality that can be calculate...
Conference Paper
Full-text available
Named Entity Disambiguation (NED) refers to the task of resolving multiple named entity mentions in a document to their correct references in a knowledge base (KB) (e.g., Wikipedia). In this paper, we propose a novel embedding method specifically designed for NED. The proposed method jointly maps words and entities into the same continuous vector s...
Article
We describe a procedure for constructing a website for publishing open data by focusing on the case of Open DATA METI, a website of the Ministry of Economy, Trade, and Industry. We developed two sites for publishing open data: a data catalog site and one for searching linked open data (LOD). The former allows users to find relevant data they want t...
Article
Open science, promoting the distribution of research information, and identifiers have been becoming big topics lately. To make research data re-used and open across the research domains, adding metadata to express its contents and assigning an identifier is necessary. The Japan Link Center, DOI registration agency has performed a pilot project to...
Conference Paper
Hashtags increase the reachability of a tweet to manifolds and consequently, has the potential to create a wider market for brands. The frequent use of a hashtag features it in the Twitter trending list. In this study we want to understand what contributes to the popularity of a hashtag. Further, hashtags generally come in groups in a tweet. In fac...
Article
Full-text available
Many aspects of online social networks (OSN) have been studied in recent years. In this article, we focus on the question of interactions in large OSN. We propose methods to study these interactions, and apply them on a platform called Nico Nico Douga (NND), with the aim of understanding cooperative behaviors, taking the form of collective creation...
Article
By decoupling the network function from the underlying dedicated hardware, enterprises can reap the benefit of outsourcing network functions to the cloud. Due to a lack of analytical tools, one cannot simply justify the worthiness of outsourcing network functions. This paper proposes a decision support framework for network function outsourcing, ca...
Article
Full-text available
Community detection is one of the most studied problems on complex networks. Although hundreds of methods have been proposed so far, there is still no universally accepted formal definition of what is a good community. As a consequence, the problem of the evaluation and the comparison of the quality of the solutions produced by these algorithms is...
Article
Full-text available
We present a novel approach for detecting, classifying, and linking entities from Twitter posts (tweets). The task is challenging because of the noisy, short, and informal nature of tweets. Consequently, the proposed approach introduces several methods that robustly facilitate successful realization of the task with enhanced performance in several...
Article
In this article, we discuss how Linked Data is suitable to create cross-disciplinary database with respect to the mechanism and the existing datasets. Linked Data mechanism is to provide “web of data” by describing data with RDF while the traditional Web pages consist of “web of documents” with HTML. In particular, DBpedia, LOD of Wikipedia, has ac...
Conference Paper
Full-text available
Linked Open Data for ACademia (LODAC) together with National Museum of Nature and Science have started collecting linked data of interspecies interaction and making link prediction for future observations. The initial data is very sparse and disconnected, making it very difficult to predict potential missing links using only one prediction model al...
Article
Full-text available
When we encounter an interesting entity (e.g., a person's name or a geographic location) while reading text, we typically search and retrieve relevant information about it. Entity linking (EL) is the task of linking entities in a text to the corresponding entries in a knowledge base, such as Wikipedia. Recently, EL has received considerable attenti...
Article
Full-text available
We present a new type of visualization designed to help the understanding of inner mechanisms of mass cooperation. This type of cooperation is ubiquitous nowadays, not only in Online Social Networks, but also in many other situations, such as scientific research on a worldwide scale. Mass cooperation is also at the source of most complex systems. O...
Article
Researchers of agriculture, life science and drug design of the need to acquire information that combines two or more life science databases for problem solving. Semantic Web technologies are already necessary for data integration between those databases. This study introduces a technique of utilizing RDF (Resource Description Framework) and OWL (W...
Article
Full-text available
We built a researcher identifier management system called the Researcher Name Resolver (RNR) to assist with the name disambiguation of authors in digital libraries on the Web. RNR, which is designed to cover all researchers in Japan, is a Web-oriented service that can be openly connected with external scholastic systems. We expect it to be widely u...
Conference Paper
Full-text available
At present, a comprehensive set of measurement, modeling, analysis, simulation, and performance evaluation techniques are employed to investigate complex networks. A direct transfer of the developed engineering methodologies to related analysis and design tasks in next-generation energy networks, energy-efficient systems and social networks is enab...
Article
This research has explored the creation process of new business models and the role of organisation in that process, in order to enhance the knowledge about how to create a new business model effectively. The multiple case-study method was applied to this research. The samples are four new business models which have been developed at a SOGO-SHOSYA,...
Article
In Twitter information primarily propagates through retweet mechanism. While a massive amount of tweets gets generated everyday, only a handful of them get retweeted widely. In this study, we have investigated the impact of user-roles in retweet phenomena. We have introduced the concept of "Information Diffusion Impact" (IDI) and identified three i...
Article
In this paper, we present a platform designed to explore visually massive cooperation between individuals. With the increasing importance of the Internet, new types of cooperation are becoming common, in which hundreds, thousands or millions of individuals act together in interaction, and produces content in a decentralized manner. As these pro- ce...
Conference Paper
This paper presents the study of pattern of interaction in research work (co-authorship in publications and co-partnership in research projects) using the techniques of social network analysis (SNA) and visualization. Along with computation of various network features at local and global levels to find the nature of collaboration, and important pos...
Conference Paper
The structure of scientific co-authorship networks can reveal patterns of scientific collaboration. Our study focused on developing a methodology for community finding in co-authorship networks that could reveal the real, i.e., physical, patterns of scientific collaboration, by using two datasets with different collaboration scopes: one considering...
Conference Paper
Although a large number of datasets gathered as Linked Open Data (LOD) is better for data sharing and re-using, the datasets themselves become more difficult to understand. Since each dataset has its own data structure, we need to understand datasets individually. In addition, since the entities in datasets are interconnected, we need to understand...
Conference Paper
Because of a huge variety of biological studies focused on different targets, i.e., from molecules to ecosystem, data produced and used in each field is also managed independently so that it is difficult to know the relationship among them. We aim to build a data hub with LOD to connect data in different biological fields to enhance search and use...
Article
Dynamics of information propagation in Twitter has been studied in the context of retweet practices. In order to build the network of information flow, either the activity network or the follower network has been considered. By combining both and introducing "Information Diffusion Impact" (IDI) on network we characterize important user roles in inf...
Article
The variety of classification systems and the new discovery of taxonomists lead to the diversity of biological information, especially taxon concepts. The association among taxon concepts across research institutes is very difficult to establish, because there is no single interpretation of the name of a taxon concept. Owing to this difficulty, fur...