Hiroyuki Sakai

(酒井浩之)

Profile Information

Affiliation: Professor, Faculty of Science and Technology Department of Science and Technology , Seikei University

Degree: Master(Engineering)(Toyohashi University of Technology)

J-GLOBAL ID: 200901074063121489
researchmap Member ID: 5000031733

External link: http://www.ci.seikei.ac.jp/sakai/

Research Interests

Research Areas

Informatics / Intelligent informatics / 自然言語処理

Research History

Apr, 2014 - Present

Faculty of Science and Technology, Department of Computer and Information Science, Seikei University
Apr, 2012 - Mar, 2014

Faculty of Science and Technology, Department of Computer and Information Science, Seikei University
Apr, 2010 - Mar, 2012

Toyohashi University of Technology
Apr, 2007 - Mar, 2010

Toyohashi University of Technology
Jun, 2005 - Mar, 2007

Toyohashi University of Technology

Education

Apr, 2002 - Mar, 2005

Toyohashi University of Technology
Apr, 2000 - Mar, 2002

Toyohashi University of Technology
- 2002

Department of Knowledge-based Information Engineering, Graduate School, Division of Engineering, Toyohashi University of Technology
Apr, 1998 - Mar, 2000

Faculty of Engineering, Toyohashi University of Technology
- 2000

Department of Knowledge-based Information Engineering, Faculty of Engineering, Toyohashi University of Technology

Papers

Extraction of Important Pages of Shareholder Convocation Notices Using Deep Learning by Automatic Generation of Training Data

Kaito Takano, Hiroyuki Sakai, Kei Nakagawa

Transactions of the Japanese Society for Artificial Intelligence, 36(1) WI2-G_1, Jan 1, 2021 Peer-reviewed
Extraction of causal and complementary information for generating market analysis comments by automatic generation of training data

SAKAI Hiroyuki, SAKAJI Hiroki, IZUMI Kiyoshi, MATSUI Tohgoroh, IRIE Keitaro

Proceedings of the Annual Conference of JSAI, 2020 1D3GS1303-1D3GS1303, 2020

<p>In this research, we propose a method for extracting sentences containing causal information from articles describing the market conditions of the Nikkei Stock Average. The sentences containing causal information are needed to generate market analysis comments. Our method extracts articles describing the market conditions of the Nikkei Stock Average from economic newspaper articles and extracting sentences containing causal information from the extracted articles by deep learning. Here, our method automatically generates the training data necessary to extract the articles describing the market conditions and sentences containing causal information by deep learning and achieved high accuracy. Moreover, our method extracts complementary information of the content described in the causal sentences by using economic causal-chain search.</p>
有価証券報告書からの事業セグメント付与された業績要因文・業績結果文の抽出

高野海斗, 酒井浩之, 北島良三

人工知能学会論文誌, 34(5) 1-22, Sep, 2019 Peer-reviewed
Automatic generation of market analysis comments by using relevant articles

SAKAI Hiroyuki, SAKAJI Hiroki, IZUMI Kiyoshi, MATSUI Tohgoroh, IRIE Keitaro

JSAI Technical Report, Type 2 SIG, 2019(FIN-022) 61, Mar 3, 2019
Extraction of causal information from summaries of financial statements by automatic generation of training data

31(2) 653-661, 2019 Peer-reviewed
潜在学習による企業トップメッセージと収益性の関係解析

北島良三, 酒井浩之, 上村龍太郎

日本知能情報ファジィ学会誌, 2019 Peer-reviewed
Extraction of Relevant Companies from Multiple Companies and Classification Based on Business

2019 Peer-reviewed
Enriching folksonomy for online videos

Hiroki Sakaji, Masaki Kohana, Akio Kobayashi, Hiroyuki Sakai

International Journal of Grid and Utility Computing, 10(3) 258-264, 2019 Peer-reviewed
Automatic generation of market analysis comments from financial articles

SAKAI Hiroyuki, SAKAJI Hiroki, IZUMI Kiyoshi, MATSUI Tohgoroh, IRIE Keitaro

JSAI Technical Report, Type 2 SIG, 2018(FIN-020) 44, Mar 20, 2018
Extraction of sentences concerning business performance forecast and economic forecast from summaries of financial statements by deep learning

Shiori Kitamori, Hiroyuki Sakai, Hiroki Sakaji

2017 IEEE Symposium Series on Computational Intelligence, SSCI 2017 - Proceedings, 2018- 1-7, Feb 2, 2018 Peer-reviewed

The proportion of individual investors in security markets has been rising at a rapid rate in Japan in recent times. As a result, the need for technology to support individual investors in making smarter investment decisions is ever increasing. When individual investors make investment decisions, what is the most important information they should hunt for? It is to be able to sense the company's future performance forecast. This is because even if the current performance is in the red (unfavorable) zone, stock prices will tend to rise if it shows that the future performance will recover from the enterprise side. Therefore, in this research, we focus on the financial results as an information source including performance forecasts. The summaries of financial statements include sentences concerning business performance forecast of companies and other associated sentences predicting the upcoming economic conditions. In this study, we extract and classify sentences indicating business performance forecasts and economic forecasts from summaries of financial statements. This allows individual investors to easily grasp the company's performance forecast. Our approach consists of two steps. First, our method extracts sentences relating to 'prediction' by using clue expressions. Second, we automatically classify these prediction oriented sentences into two classes by using deep learning. Our method is capable of automatically generating training data for deep learning and greatly omits efforts to create the training data set by hand.
Discovery of rare causal knowledge from financial statement summaries

Hiroki Sakaji, Risa Murono, Hiroyuki Sakai, Jason Bennett, Kiyoshi Izumi

2017 IEEE Symposium Series on Computational Intelligence, SSCI 2017 - Proceedings, 2018- 1-7, Feb 2, 2018 Peer-reviewed

What would happen if temperatures were subdued and result in a cool summer? One can easily imagine that air conditioner, ice cream or beer sales would be suppressed as a result of this. Less obvious is that agricultural shipments might be delayed, or that sound proofing material sales might decrease. The ability to extract such causal knowledge is important, but it is also important to distinguish between cause-effect pairs that are known and those that are likely to be unknown, or rare. Therefore, in this paper, we propose a method for extracting rare causal knowledge from Japanese financial statement summaries produced by companies. Our method consists of three steps. First, it extracts sentences that include causal knowledge from the summaries using a machine learning method based on an extended language ontology. Second, it obtains causal knowledge from the extracted sentences using syntactic patterns. Finally, it extracts the rarest causal knowledge from the knowledge it has obtained.
Extracting Laboratory Front Pages from University Websites

Hiroki Sakaji, Atsuya Miyazaki, Hiroyuki Sakai, Kiyoshi Izumi

ADVANCES IN NETWORK-BASED INFORMATION SYSTEMS, NBIS-2017, 7 1117-1125, 2018

In this paper, we propose a method for extracting laboratory front pages from university websites. There are more than 779 universities and colleges in Japan. For selecting a university or a college, some high school students want to know what laboratories these universities or colleges have. To learn about these laboratories, high school students have to search the laboratory front pages from the university websites. However, sometimes it is difficult to find a laboratory front page because they are sometimes buried deep in the hierarchy of university websites. Our method extracts laboratory front pages by using a support vector machine model and applying certain rules. We also developed a laboratory search system that can be used to retrieve laboratory front pages extracted with our method. We evaluated our method and confirmed that is attained 85.0% precision and 65.5% recall.
A system for classifying proposals and estimating start pages stated in notice of annual meeting of shareholders

Kaito Takano, Hiroyuki Sakai, Hiroki Sakaji, Kiyoshi Izumi

Journal of Natural Language Processing, 25(1) 3-32, 2018 Peer-reviewed

<p>In this paper, we describe research on applied systems for realizing efficiency of work to store information of notice of annual meeting of shareholders in the database by using text mining technology. We aim to estimate start pages of proposals stated in notice of the meeting of shareholders and classify which proposal the page is. And we developed a system that automatically performs these tasks using text information of the notice of convocation of shareholders, and actually operates it. As a result of comparative experiment between our implemented system and conventional manual work, the working time was shortened to about 1/10. We propose three methods for classifying proposals. The first method classifies proposals by specialized terms extracted from training data. The second method classifies proposals by using deep learning. The final method classifies proposals by extracted proposal title. We evaluated our methods, and the effectiveness of each method was verified. </p>
Financial Informatics : AI Application in Finance

Journal of the Japanese Society for Artificial Intelligence, (32) 905-910, Nov 1, 2017
Estimation of Starting Pages for Each Agenda Item Stated in Notice of Annual Meeting of Shareholders

TAKANO Kaito, SAKAI Hiroyuki, SAKAJI Hiroki, IZUMI Kiyoshi, OKADA Nana, MIZUUCHI Toshikazu

JSAI Technical Report, Type 2 SIG, 2017(FIN-018) 10, Mar 10, 2017

In this research, we aim to predict start pages of proposals stated in notice of the meeting of shareholders and classify which proposal the page is. We propose two methods that classification method of proposals. The first method heuristically predicts the page on which the proposal is described. Moreover our method extracts specialized terms of each proposal and assigns weights to them. After that, our method classifies proposals by specialized terms. The second method classifies proposals using deep learning. Each methods were evaluated, and the effectiveness of each methods was verified.
Estimation of Tags via Comments on Nico Nico Douga

Hiroki Sakaji, Masaki Kohana, Akio Kobayashi, Hiroyuki Sakai

PROCEEDINGS OF 2016 19TH INTERNATIONAL CONFERENCE ON NETWORK-BASED INFORMATION SYSTEMS (NBIS), 550-553, 2016 Peer-reviewed

In this paper, we propose a method that estimates new tags from user comments on videos on the Nico Nico Douga website. On Nico Nico Douga, users can post tags and comments on videos. However, users cannot post more than 12 tags on a video; therefore, there are some important tags that could be posted but are sometimes missed. We present a technique to acquire some of these missing tags by choosing new tags that score well in a scoring method developed by us.
決算短信PDFからの業績予測文の抽出

北森詩織, 酒井浩之, 坂地泰紀

電子情報通信学会論文誌, J100-D(2) 150-161, 2016 Peer-reviewed
Extraction of Polarity Comments from Nico Nico Douga

Hiroki Sakaji, Junya Ishibuchi, Hiroyuki Sakai

International Journal of Space-Based and Situated Computing, 6(3) 165-172, 2016 Peer-reviewed
決算短信PDFからの原因・結果表現の抽出

坂地泰紀, 酒井浩之, 増山繁

電子情報通信学会論文誌D, J98-D(5) 811-822, May, 2015 Peer-reviewed
Extraction of causal information from PDF files of the summary of financial statements of companies

Hiroyuki Sakai, Hiroko Nishizawa, Shogo Matsunami, Hiroki Sakaji

Transactions of the Japanese Society for Artificial Intelligence, 30(1) 172-182, Jan 6, 2015 Peer-reviewed

In this paper, we propose a method of extracting causal information from PDF files of the summary of financial statements of companies, e.g., ”The sales of smart phones was expanded continually”. Cause information is useful for investors in selecting companies to invest. We downloaded 106,885 PDF files of the summary of financial statements of companies fromWeb pages of the companies automatically. Our method extracts causal information from the PDF files by using clue expressions (e.g., ”was expanded”) and keywords relevant to a company. The clue expressions are extracted from the PDF files of the summary of financial statements of companies and articles concerning business performance of companies automatically. We developed the search system which is able to retrieve causal informations extracted by our method. The search system shows causal information containing a keyword inputted by users, and the summary of financial statements containing the retrieved causal information. We evaluated our method and it attained 83.91% precision and 55.04% recall, respectively. Moreover, we compared our method with Sakai et al’s method originally proposed for extracting causal information from financial articles concerning business performance of companies and experimental results showed that our method outperforms Sakai et al’s method.
Extracting Polarity Comments from Nico Nico Douga

Hiroki Sakaji, Junya Ishibuchi, Hiroyuki Sakai

PROCEEDINGS 2015 18TH INTERNATIONAL CONFERENCE ON NETWORK-BASED INFORMATION SYSTEMS (NBIS 2015), 669-672, 2015 Peer-reviewed

In this paper, we propose a method that extracts positive comments (e.g. (motto hyouka sarerubeki: should be valued)") and negative comments (e.g. (hetakuso: dub)") from Nico Nico Douga automatically. For example, positive comments extracted by our method are beneficial for analysis of campaign broadcast.
企業の決算短信PDFからの業績要因の抽出

酒井浩之, 西沢裕子, 松並祥吾, 坂地泰紀

人工知能学会論文誌, 30(1) 172-182, Jan, 2015 Peer-reviewed
Important Causal Information Extraction from Japanese Financial Articles Concerning Business Performance of Companies

SAKAI Hiroyuki, MASUYAMA Shigeru

The IEICE transactions on information and systems (Japanese edition), 96(11) 2866-2870, Nov, 2013 Peer-reviewed
Extraction of the Effect and the Technology Terms from a Patent Document

Hirofumi Nonaka, Akio Kobayashi, Hiroki Sakaji, Yusuke Suzuki, Hiroyuki Sakai, Shigeru Masuyama

Journal of Japan Industrial Management Associastion, 63 105-111, 2012 Peer-reviewed
A Polarity Assignment Method to Basis Expressions of Economic Trends Extracted from Economic Newspaper Articles

TANIGUCHI Shota, SAKAJI Hiroki, SAKAI Hiroyuki, MASUYAMA Shigeru

The IEICE transactions on information and systems (Japanese edetion), 94(6) 1039-1043, Jun, 2011 Peer-reviewed
Cross-Bootstrapping: An Automatic Extraction Method of Solution-Effect Expressions from Patent Documents

SAKAJI Hiroki, NONAKA Hirofumi, SAKAI Hiroyuki, MASUYAMA Shigeru

The IEICE transactions on information and systems (Japanese edetion), 93(6) 742-755, Jun, 2010 Peer-reviewed
An automatic thesaurus construction method for technological terms in patent maps

Akio Kobayashi, Hirofumi Nonaka, Shigeru Masuyama, Hiroyuki Sakai

40th International Conference on Computers and Industrial Engineering: Soft Computing Techniques for Advanced Manufacturing and Service Systems, CIE40 2010, 2010 Peer-reviewed

A patent map, a visual representation of related patent information, is an effective strategic tool for analysis of patent application trends. In particular, an effect-technology type patent map is commonly used because patent examinations are based on technology and effect terms of the inventions. However, the patent map creation is a costly task, because most of patent maps are manually created. Therefore, we propose a method for automatic patent map construction. As a first step to develop such a method, we focused on technological terms that represent technologies for realizing the effect. In this step, we develop our method for clustering technological terms, because we need to use clusters of technological terms for our patent maps. The automatic technological terms clustering requires determining the relationship between technological terms. We developed a method to find relationship between technological terms by using a similarity between morphemes of these words as well as existing thesaurus knowledge.
Extraction of correspondences between questions and answers from web message boards

Yusuke Suzuki, Hayato Yokota, Hiroyuki Sakai, Shigeru Masuyama

Transactions of the Japanese Society for Artificial Intelligence, 25(1) 168-173, 2010 Peer-reviewed

We propose a method to extract a lot of correspondences between questions and answers from a Web message board automatically. We use Web message boards as information sources because Web messasge boards have a lot of articles posted by general users. We extract correspondences between questions and answers that can be used in question answering systems to support natural language sentence input. At first, our proposed method classifies messages of a Web message board into either questions or others. Next, our method extracts a set of root-node pairs from the thread tree of a Web message board, where we define the thread tree when the root is an article classified as a question, and nodes are articles classified as answer candidates. Our method finds correspondences between questions and answers using two clues, (1)similarity between their articles, (2)link count between their articles. We experimented the proposed method, discussed results, and analyzed errors.
Assigning Polarity to Causal Information in Financial Articles on Business Performance of Companies

Hiroyuki Sakai, Shigeru Masuyama

IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, E92D(12) 2341-2350, Dec, 2009 Peer-reviewed

We propose a method of assigning polarity to causal information extracted from Japanese financial articles concerning business performance of companies. Our method assigns polarity (positive or negative) to causal information in accordance with business performance, e.g. "zidousya no uriage ga koutyou: (Sales of cars are good)" (The polarity positive is assigned in this example). We may use causal expressions assigned polarity by our method, e.g., to analyze content of articles concerning business performance circumstantially. First, our method classifies articles concerning business performance into positive articles and negative articles. Using them, our method assigns polarity (positive or negative) to causal information extracted from the set of articles concerning business performance. Although Our method needs training dataset for classifying articles concerning business performance into positive and negative ones. our method does not need a training dataset for assigning polarity to causal information. Hence. even if causal information not appearing in the training dataset for classifying articles concerning business performance into positive and negative ones exist, our method is able to assign it polarity by using statistical information of this classified sets of articles. We evaluated our method and confirmed that it attained 74.4% precision and 50.4% recall of assigning polarity positive, and 76.8% precision and 61.5% recall of assigning polarity negative, respectively.
Extraction of Information on the Technical Effect from a Patent Document

Sakai Hiroyuki, Nonaka Hirohumi, Masuyama Shigeru

Transactions of the Japanese Society for Artificial Intelligence, 24(6) 531-540, 2009

We propose a method for extracting information on the technical effect from a patent document. The information on the technical effect extracted by our method is useful for generating patent maps (see e.g., Figure 1.) automatically or analyzing the technical trend from patent documents. Our method extracts expressions containing the information on the technical effect by using frequent expressions and clue expressions effective for extracting them. The frequent expressions and clue expressions are extracted by using statistical information and initial clue expressions automatically. Our method extracts expressions containing the information on the technical effect without predetermined patterns given by hand, and is expected to be applied to other tasks for acquiring expressions that have a particular meaning (e.g., information on the means for solving the problems) not limited to the information on the technical effect. Our method achieves not only high precision (78.0%) but also high recall (77.6%) by acquiring such clue expressions automatically from patent documents.
Polarity Assignment to Causal Information Extracted from Financial Articles Concerning Business Performance of Companies

Hiroyuki Sakai, Shigeru Masuyama

RESEARCH AND DEVELOPMENT IN INTELLIGENT SYSTEMS XXV, 307-320, 2009 Peer-reviewed

We propose a method of assigning polarity to causal information extracted from Japanese financial articles concerning business performance of companies. Our method assigns polarity (positive or negative) according to business performance to causal information, e.g. "zidousya no uriage ga koutyou: (Sales of cars are good)" (The polarity positive is assigned in this example.). First, our method classifies articles concerning business performance into positive articles and negative articles. Using this classified sets of articles, our method assigns polarity (positive or negative) to causal information extracted from the set of articles concerning business performance. We evaluated our method and it attained 75.3% precision and 47.9% recall of assigning polarity positive, and 77.0% precision and 58.5% recall of assigning polarity negative, respectively.
テキストマイニングを用いた判例文書の分類および情報抽出

野中尋史, 酒井浩之, 増山繁

情報ネットワークローレビュー, 8 74-85, 2009 Peer-reviewed
An informative DOM subtree indentification method from web pages in unfamiliar web sites

Masanobu Tsurutata, Hiroyuki Sakai, Shigeru Masuyama

IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, E91D(4) 986-989, Apr, 2008 Peer-reviewed

We propose a method of informative DOM* subtree identification from a Web page in an unfamiliar Web site. Our method uses layout data of DOM nodes generated by a generic Web browser. The results show that our method outperforms a baseline method, and was able to identify informative DOM subtrees from Web pages robustly.
Cause information extraction from financial articles concerning business performance

Hiroyuki Sakai, Shigeru Masuyama

IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, E91D(4) 959-968, Apr, 2008 Peer-reviewed

We propose a method of extracting cause information from Japanese financial articles concerning business performance. Our method acquires cause information, e.g. (zidousya no uriage ga koutyou: Sales of cars were good)". Cause information is useful for investors in selecting companies to invest. Our method extracts cause information as a form of causal expression by using statistical information and initial clue expressions automatically. Our method can extract causal expressions without predetermined patterns or complex rules given by hand, and is expected to be applied to other tasks for acquiring phrases that have a particular meaning not limited to cause information. We compared our method with our previous one originally proposed for extracting phrases concerning traffic accident causes and experimental results showed that our new method outperforms our previous one.
Automatic extraction of basis expressions that indicate economic trends

Hiroki Sakaji, Hiroyuki Sakai, Shigeru Masuyama

ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 5012 977-984, 2008 Peer-reviewed

This paper proposes a method to automatically extract basis expressions that indicate economic trends from newspaper articles by using a statistical method. We also propose a method to classify them into positive expressions that indicate upbeat, and negative expressions that indicate downturn in economy, respectively. It is important for companies, governments and investors to predict economic trends in order to forecast revenue, sales of products, prices of commodities and stock prices. We considered that basis expressions are useful for the companies, governments and investors to forecast economic trends. We extracted basis expressions, and classified them into positive expressions or negative expressions as information to forecast economic trends. Our method used a bootstrap method that was minimally a supervised algorithm for extracting basis expressions. Moreover, our method classified basis expressions into positive expressions or negative ones without dictionaries.
Extraction of cause information from newspaper articles concerning business performance

Hiroyuki Sakai, Shigeru Masuyama

ARTIFICIAL INTELLIGENCE AND INNOVATIONS 2007: FROM THEORY TO APPLICATIONS, 205-+, 2007 Peer-reviewed

We propose a method of extracting cause information from Japanese newspaper articles concerning business performance. Cause information is useful for investors in selecting companies to invest. Our method extracts cause information as a form of causal expression by using statistical information and initial clue phrases automatically. Our method can extract causal expressions without predetermined patterns or complex rules given by hand, and is expected to be applied to other tasks or language for acquiring phrases that have a particular meaning not limited to cause information. We compared our method with our previous method originally proposed for extracting phrases concerning traffic accident causes and experimental results showed that our new method outperforms our previous one.
Research on Automatic Generation of Presentation Slides from a LATEX Manuscript of a Paper

MIYAMOTO Masato, SAKAI Hiroyuki, MASUYAMA Shigeru

Journal of Japan Society for Fuzzy Theory and Intelligent Informatics, 18(5) 752-760, Oct, 2006 Peer-reviewed

At the presentation of his/her research results, it is necessary and indispensable for a researcher to prepare presentation slides for making audience understood his/her research results within limited time. However, making slides requires a lot of time and much effort. Therefore, many researchers hope to prepare slides more efficiently. In this research, we propose a method for automatically generating slides from a LATEX manuscript of a paper, which aims at reduction of researchers' workload. Our method analyzes a LATEX file of a paper and allocates content contained in the paper to slides and generates itemization. In the analysis of a LATEX file, our method uses only necessary information for slide generation and deletes unnecessary information. Here, our method determines necessary information for slide generation by using a typical structure of the LATEX file. In the allocation of content to slides, our method calculates the score of nouns based on frequency, entropy and idf score and extracts importance sentences contained in the paper by using the score and allocates the extracted important sentences to the slides. In generation of itemization, our method generates itemization by using the conjunction that signifies a parallel relation. The reason why our method generates itemization by using the conjunction is as follows: A sentence corresponding to a sentence including the conjunction that signifies a parallel relation may be contained in the paper. Hence, our method generates itemization that consists of the sentence including the conjunction and the sentence corresponding to it. We evaluated our method, and the experimental results showed that our method turned out to be e?ective for generating slides from a LATEX file of a paper.
Extraction of Expressions concerning Accident Cause contained in Articles on Traffic Accidents

SAKAI HIROYUKI, UMEMURA SHOUJI, MASUYAMA SHIGERU

Journal of natural language processing, 13(4) 99-124, 2006 Peer-reviewed
A Multiple-Document Summarization System Introducing User Interaction for Reflecting User's Summarization Needs

SAKAI Hiroyuki, MASUYAMA Shigeru

Journal of Japan Society for Fuzzy Theory and Intelligent Informatics, 18(2) 265-279, 2006

We propose a multiple-document summarization system with user interaction for coping appropriately with the user's varying summarization needs. Generally, automatic document summarization is a technology for producing a summary corresponding to a single document. However, in order that a person can better perform intellectual activities, a technology for producing a summary of more than one document (i.e. multiple-document summarization) becomes more important than producing a summary of a single document. Our multiple-document summarization system extracts keywords from the document set to be summarized and displays the k best keywords scored by our system to a user on the screen. From the displayed keywords, the user selects those reflecting the user's summarization needs. Here, in this paper, we define a user's "summarization needs" as content in which that particular user is interested. Our multiple-document summarization system produces a summary suitable for a user's summarization needs by using user-selected keywords. For evaluation of our method, we participated in the TSC3 of NTCIR4, an evaluation workshop for information retrieval and summarization held by National Institute of Informatics. We participated in this workshop by having our system select the 12 best keywords as scored by our system, and our entered system exhibited good performance in content evaluation of multiple-document summarization task. Moreover, we evaluate the effectiveness of user interaction, and the experimental results show that our user interaction system is effective.
可読性の向上を目的とした片仮名表記外来語の換言知識獲得

吉田辰巳, 遠間雄二, 増山繁, 酒井浩之

電子情報通信学会論文誌D-II, J88-D-II 1237-1245, 2005 Peer-reviewed
略語とその原型語との対応関係のコーパスからの自動獲得手法の改良

酒井浩之, 増山繁

自然言語処理, 12(4) 207-231, 2005 Peer-reviewed
Knowledge Acquisition about the Deletion Possibility of Adnominal Verb Phrases

SAKAI Hiroyuki, MASUYAMA Shigeru

The transactions of the Institute of Electronics, Information and Communication Engineers. D-II, 87(8) 1641-1652, Aug, 2004 Peer-reviewed
動詞連体修飾節の省略可能性に関するコーパスからの知識獲得

電子情報通信学会論文誌D-II, vol.J87-D-II, no.8, pp.1641--1652, 2004
A statistical method for acquiring knowledge about the abbreviation possibility of some of multiple adnomial phrases

IEICE Transactions on Information and Systems, vol.E86-D, no.9, pp.1710-1718, 2003 Peer-reviewed
Knowledge Acquisition of Relation between Abbreviations and Their Original Words

SAKAI Hiroyuki, MASUYAMA Shigeru

The transactions of the Institute of Electronics, Information and Communication Engineers. D-II, 85(10) 1624-1628, Oct, 2002 Peer-reviewed
Knowledge Acquisition about the Abbreviation Possibility of Verb Phrases

SAKAI HIROYUKI, SHINOHARA NAOTSUGU, MASUYAMA SHIGERU, YAMAMOTO KAZUHIDE

Journal of Natural Language Processing, 9(3) 41-62, Jul, 2002 Peer-reviewed

This paper proposes a method of acquiring knowledge about the abbreviation possibility of verb phrases. In a certain clause containing a verb and including verb phrases, the proposed method extracts some clauses which contain the same verb and have different case postpositional particles from a large corpus. Then, our method recognizes verb phrases possible to be abbreviated by comparing the verb phrases with the verb phrases contained in the extracted clauses. In our method, the verb phrases containing important piece of information is hard to recognize as being possible to be abbreviated, and the verb phrases containing information which appear in previous sentences is easy to recognize as being possible to be abbreviated. The evaluation of our method by experiments shows that the precision is 78.0% and the recall is 67.9%.We compare our method with the method which recognizes verb phrases possible to be abbreviated by recognizing optional case elements described in a case frame dictionary as being possible to be abbreviated. By the evaluation results, we conclude that our method outperforms the method which recognizes verb phrases possible to be abbreviated by using a case frame dictionary.

Misc.

金融・経済ドメインを対象とした言語処理

坂地泰紀, 和泉潔, 酒井浩之

自然言語処理 = Journal of natural language processing, 27(4) 951-955, Dec, 2020
Extraction of causal and complementary information for generating market analysis comments by automatic generation of training data

酒井浩之, 坂地泰紀, 和泉潔, 松井藤五郎, 入江圭太郎

人工知能学会全国大会(Web), 34th, 2020
Determination of unexpectedness of cause-result expressions extracted from summary of financial statements

117(207) 93-98, Sep 7, 2017
Extraction of causal information from summaries of financial statements

117(207) 87-91, Sep 7, 2017
都道府県議会会議録からの意見や意志を表す発言の抽出

坂地泰紀, 酒井浩之, 小林暁雄, 内田ゆず, 乙武北斗, 高丸圭一, 木村泰知

言語処理学会第23回年次大会(NLP2017), 426-429, Mar, 2017

Presentations

Extraction of sentences concerning business performance forecast and economic forecast from summaries of financial statements by deep learning

Shiori Kitamori, Hiroyuki Sakai, Hiroki Sakaji

IEEE Symposium on Computational Intelligence for Financial Engineering & Economics, Nov, 2017
Discovery of Rare Causal Knowledge from Financial Statement Summaries

Hiroki Sakaji, Risa Murono, Hiroyuki Sakai, Jason Bennett, Kiyoshi Izumi

IEEE Symposium on Computational Intelligence for Financial Engineering & Economics, Nov, 2017
Extracting Laboratory Front Pages from University Websites

Hiroki Sakaji, Atsuya Miyazaki, Hiroyuki Sakai, Kiyoshi Izumi

he 6th International Workshop on Web Services and Social Media In conjunction with The 20th International Conference on Network-Based Information Systems, Aug, 2017
Estimation of Tags via Comments on Nico Nico Douga

Hiroki Sakaji, Masaki Kohana, Akio Kobayashi, Hiroyuki Sakai

The 5th International Workshop on Web Services and Social Media In conjunction with the 19th International Conference on NBiS-2016, Sep, 2016
企業の決算短信ＰＤＦから抽出した業績要因への極性付与

酒井浩之, 小林義和, 坂地泰紀

第15回人工知能学会金融情報学研究会, Sep 26, 2015

Teaching Experience

Professional Memberships

Research Projects

Company search system for employment support by using financial statements and web information

Grants-in-Aid for Scientific Research, Japan Society for the Promotion of Science, 2015 - 2018

Sakai Hiroyuki
Automatic Generation of Patentmaps from Patent Documents by Extraction of Expressions using Bootstrapping

Grants-in-Aid for Scientific Research, Japan Society for the Promotion of Science, 2010 - 2012

MASUYAMA Shigeru, SAKAI Hiroyuki
Development of the method of extracting causal expressions from Japanese financial articles concerning business performance and analyzing them for supporting decision on investment

Grants-in-Aid for Scientific Research(若手研究(B)), Ministry of Education, Culture, Sports, Science and Technology, 2009 - 2011

Hiroyuki SAKAI
Simultaneous interpreting system based on segmentation, translation and connection of spoken sentences

Grants-in-Aid for Scientific Research, Japan Society for the Promotion of Science, 2008 - 2010

INAGAKI Yasuyoshi, MATSUBARA Shigeki
Research on the use of information by using large-scale information sources

The Other Research Programs, 2005 - 2010

To the list screen