
Fumihito NishinoFujitsu Ltd. · Fujitsu Laboratories
Fumihito Nishino
About
17
Publications
1,326
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
55
Citations
Publications
Publications (17)
Scientific publication management services are changing drastically. On the one hand, researchers demand intelligent search services to discover scientific publications. On the other hand, publishers need to incorporate semantic information to better organize their digital assets and make publications more discoverable. For this purpose, we investi...
Fujitsu Laboratories is conducting R&D on technology to integrate and utilize data by using Linked Data, which is a standard methodology for publishing data on the Web that is being promoted by the World Wide Web Consortium (W3C). W3C is the main international standards organization for Web technologies. Linked Data uses a machine-readable structur...
The fact that a company always owns various names, such as Chinese full names, Chinese abbreviative names and English abbreviative
names, makes it very difficult to collect and extract relative information about the company, because: (1) It is hard to identify
a company’s Chinese abbreviative names. (2) It is hard to discover relationships between...
Using abundant Web resources to mine Chinese term translations can be applied in many fields such as reading/writing as- sistant, machine translation and cross- language information retrieval. In mining English translations of Chinese terms, how to obtain effective Web pages and evaluate translation candidates are two challenging issues. In this pa...
This paper proposes a lexicon-constrained character model that com- bines both word and character features to solve complicated issues in Chinese morphological analysis. A Chinese character-based model constrained by a lexicon is built to acquire word building rules. Each character in a Chinese sen- tence is assigned a tag by the proposed model. Th...
BBS is an electrical forum on Web where people discuss many topics. So it’s a challenging problem to retrieve hot topics from
it. There are various features of hot topics. Though count of posts on BBS about topic is a simple and effective feature for
hotness of topic, it is shown in the paper that a better result can be obtained if irrelevant posts...
The new word finding is a difficult and indispensable task in Chinese segmentation. The traditional methods used the string
statistical information to identify the new words in the large-scale corpus. But it is neither convenient nor powerful enough
to describe the words’ internal and external structure laws. And it is even the less effective when...
Images play a very important role in web content delivery. Many WWW images contain text information that can be used for web indexing and searching. A new text extraction and recognition algorithm is proposed in this paper. The character strokes in the image are first extracted by color clustering and connected component analysis. A novel stroke ve...
The mformaton to mclu'e Jn a summary varies depending on the author's intention and the use of the summary To create the best summaries, the appropriate goals of the extracting process should be set and guide should be outlined that instructs the system how to meet the tasks The approach described in ths report intended to be a basic archttecture t...
Mining terminology translation from a large amount of Web data can be applied in many fields such as reading/writing assistant,
machine translation and cross-language information retrieval. How to find more comprehensive results from the Web and obtain
the boundary of candidate translations, and how to remove irrelevant noises and rank the remained...
In this paper, we present a tool, Web Orchestration, which allows people to customize and share the web information in a simple
way. Our work is based on the web annotation and web scraping technique. It adopts B/S architecture, and has a user-friendly
interface. It can be used in many aspects, such as web information monitoring, web information sh...
Protein name recognition is a fundamental precursor to information extraction of protein-protein interactions from MEDLINE abstracts. In this paper, we explore how to adapt maximum entropy approach to protein name recognition. We also present a novel method which uses the determinis- tic finite automata to deal with the tag sequence gotten from the...
Named entity recognition is a very important part of information retrieval and information extraction. Classification is also very important. This paper investigates the sub-classification of named entities from the point of view of information retrieval and information extraction. This paper also presents multi-classification and gives detailed in...
寛治 内野 文人 西野 Kanji Uchino- [...]
フミヒト ニシノ
http://www.tulips.tsukuba.ac.jp/mylimedio/dl/page.do?issueid=695223&tocid=100086006&page=26-31
Projects
Project (1)