Hiroaki Kawashima

(川嶋宏彰)

Profile Information

Affiliation: Professor, Graduate School of Information Science, University of Hyogo

Degree: Doctor of Informatics(Kyoto University)

J-GLOBAL ID: 200901098553710896
researchmap Member ID: 5000031823

External link: https://interaction-lab.org/en/

Research Interests

Research Areas

Research History

Apr, 2021 - Present

Professor, Graduate School of Information Science, University of Hyogo
Apr, 2019 - Present

Professor, School of Social Information Science, University of Hyogo
Apr, 2020 - Mar, 2022

館長, 兵庫県立大学神戸商科学術情報館
Feb, 2015 - Mar, 2019

Associate Professor, Graduate School of Informatics, Kyoto University
Oct, 2014 - Mar, 2018

PRESTO Researcher, Japan Science and Technology Agency (JST)

Education

Apr, 2001 - Mar, 2002

Department of Intelligence Science and Technology, Doctor Course (), Graduate School of Informatics, Kyoto University
Apr, 1999 - Mar, 2001

Department of Intelligence Science and Technology, Master Course, Graduate School of Informatics, Kyoto University
Apr, 1995 - Mar, 1999

Undergraduate School of Electrical and Electronic Engineering, Faculty of Engineering, Kyoto University

Major Committee Memberships

Jun, 2023 - Present

Associate Editor-in-Chief of the Transactions on Information and Systems (ED), The Institute of Electronics, Information and Communication Engineers (IEICE)
Jun, 2015 - May, 2019

英文論文誌D 編集委員, 電子情報通信学会
Jun, 2015 - May, 2019

PRMU Committee, IEICE
Apr, 2017 - Mar, 2019

Delegate, The Japan Society for Artificial Intelligence (JSAI)

Awards

Papers

Speech Estimation in Non-Stationary Noise Environments Using Timing Structures between Mouth Movements and Sound Signals

Hiroaki Kawashima, Yu Horii, Takashi Matsuyama

11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 442-445, 2010 Peer-reviewed

A variety of methods for audio-visual integration, which integrate audio and visual information at the level of either features, states, or classifier outputs, have been proposed for the purpose of robust speech recognition. However, these methods do not always fully utilize auditory information when the signal-to-noise ratio becomes low. In this paper, we propose a novel approach to estimate speech signal in noise environments. The key idea behind this approach is to exploit clean speech candidates generated by using timing structures between mouth movements and sound signals. We first extract a pair of feature sequences of media signals and segment each sequence into temporal intervals. Then, we construct a cross-media timing-structure model of human speech by learning the temporal relations of overlapping intervals. Based on the learned model, we generate clean speech candidates from the observed mouth movements.
Gaze probing: Event-based estimation of objects being focused on

Ryo Yonetani, Hiroaki Kawashima, Takatsugu Hirayama, Takashi Matsuyama

Proceedings - International Conference on Pattern Recognition, 101-104, 2010 Peer-reviewed

We propose a novel method to estimate the object that a user is focusing on by using the synchronization between the movements of objects and a user's eyes as a cue. We first design an event as a characteristic motion pattern, and we then embed it within the movement of each object. Since the user's ocular reactions to these events are easily detected using a passive camera-based eye tracker, we can successfully estimate the object that the user is focusing on as the one whose movement is most synchronized with the user's eye reaction. Experimental results obtained from the application of this system to dynamic content (consisting of scrolling images) demonstrate the effectiveness of the proposed method over existing methods. © 2010 IEEE.
Estimation of user interest using time delay features between proactive content presentation and eye movements

Jean-Baptiste Dodane, Takatsugu Hirayama, Hiroaki Kawashima, Takashi Matsuyama

Proceedings - 2009 3rd International Conference on Affective Computing and Intelligent Interaction and Workshops, ACII 2009, 201-208, 2009 Peer-reviewed

Human-machine interaction still lacks smoothness and naturalness despite the widespread utilization of intelligent systems and emotive agents. In order to improve the interaction, this work proposes an approach to estimate user's interest based on the relationships between dynamics of user's eye movements, more precisely the endogenous control mode of saccades, and machine's proactive visual content presentation. Under a specially-designed presentation phase to make the user express the endogenous saccades, we analyzed delays between the saccades and the presentation events. As a result, we confirmed that the delay while the user's gaze is maintained on the previous presented content regardless of the next event, called resistance, is a good indicator of the interest estimation (70% success, upon 20 experiments). It showed higher accuracy than the conventional interest estimation based on gaze duration. © 2009 IEEE.
Person-Independent Face Tracking Based on Dynamic AAM Selection

Akihiro Kobayashi, Jyunji Satake, Takatsugu Hirayama, Hiroaki Kawashima, Takashi Matsuyama

IEEE International Conference on Automatic Face and Gesture Recognition (FG), Sep, 2008 Peer-reviewed
動画像における時空間ダイナミクスのモデル化

川嶋宏彰, 三井健, 松山隆司

第11回画像の認識・理解シンポジウム (MIRU), 339-346, Jul, 2008 Peer-reviewed
口唇動作と音声のタイミング構造に基づく話者検出

堀井悠, 川嶋宏彰, 松山隆司

第11回画像の認識・理解シンポジウム (MIRU), 193-200, Jul, 2008 Peer-reviewed
Speaker Detection Using the Timing Structure of Lip Motion and Sound

Yu Horii, Hiroaki Kawashima, Takashi Matsuyama

IEEE CVPR Workshop on Interaction Dynamics on Human Communicative Behavior Analysis, Jun, 2008 Peer-reviewed
Visual filler: Facilitating smooth turn-taking in video conferencing with transmission delay

Hiroaki Kawashima, Takeshi Nishikawa, Takashi Matsuyama

Conference on Human Factors in Computing Systems - Proceedings, 3585-3590, 2008 Peer-reviewed

Turn-taking in a smooth conversation is supported by the anticipation of the floor handover timing among participants. However, it becomes difficult to maintain natural turn-taking in video conferencing with transmission delays because the utterances and movements of each participant are presented to the others with a time lag, which often leads to a collision of utterances. In order to facilitate smooth communication over a video-conferencing system, we propose a novel method, "Visual Filler," that fills temporal gaps in turn-taking caused by the existence of delays. Visual Filler overlays an artificial visual stimulus that has a function similar to that of filler sounds on a screen with participant images. We have evaluated the effectiveness of a Visual Filler for reducing the unnaturalness of turn-taking on a simulated dyadic dialog situation with a delay.
時区間ハイブリッドダイナミカルシステムを用いたマルチメディア・タイミング構造のモデル化

川嶋宏彰, 松山隆司

情報処理学会論文誌, 48(12) 3680-3691, Dec, 2007 Peer-reviewed
落語の役柄交替における視覚的「間合い」の解析

川嶋宏彰, 西川猛司, 松山隆司

情報処理学会論文誌, 48(12) 3715-3728, Dec, 2007 Peer-reviewed
Interval-Based Linear Hybrid Dynamical System for Modeling Cross-Media Timing Structures in Multimedia Signals

Hiroaki Kawashima, Takashi Matsuyama

International Conference on Image Analysis and Processing (ICIAP), 789-794, Sep, 2007 Peer-reviewed
Visual Filler: 視覚刺激提示による伝送遅延状況下での円滑な遠隔対話の実現

西川猛司, 川嶋宏彰, 松山隆司

情報科学技術レターズ, 311-314, Sep, 2007 Peer-reviewed
漫才の動的構造の分析―間の合った発話タイミング制御を目指して―

川嶋宏彰, スコギンズ・リーバイ, 松山隆司

ヒューマンインタフェース学会, 9(3) 379-390, Aug, 2007 Peer-reviewed
表情譜: 顔パーツ間のタイミング構造に基づく表情の記述

平山高嗣, 川嶋宏彰, 西山正紘, 松山隆司

ヒューマンインタフェース学会, 9(2) 201-211, May, 2007 Peer-reviewed
Modeling timing structure in multimedia signals

Hiroaki Kawashima, Kimitaka Tsutsumi, Takashi Matsuyama

ARTICULATED MOTION AND DEFORMABLE OBJECTS, PROCEEDINGS, 4069 453-463, 2006 Peer-reviewed

Modeling and describing temporal structure in multimedia signals, which are captured simultaneously by multiple sensors, is important for realizing human machine interaction and motion generation. This paper proposes a method for modeling temporal structure in multimedia signals based on temporal intervals of primitive signal patterns. Using temporal difference between beginning points and the difference between ending points of the intervals, we can explicitly express timing structure; that is, synchronization and mutual dependency among media signals. We applied the model to video signal generation from an audio signal to verify the effectiveness.
Multiphase Learning for an Interval-Based Hybrid Dynamical System

Hiroaki Kawashima, Takashi Matsuyama

IEICE Transactions on Fundamentals, E88-A(11) 3022-3035, Nov, 2005 Peer-reviewed

This paper addresses the parameter estimation problem of an interval-based hybrid dynamical system (interval system). The interval system has a two-layer architecture that comprises a finite state automaton and multiple linear dynamical systems. The automaton controls the activation timing of the dynamical systems based on a stochastic transition model between intervals. Thus, the interval system can generate and analyze complex multivariate sequences that consist of temporal regimes of dynamic primitives. Although the interval system is a powerful model to represent human behaviors such as gestures and facial expressions, the learning process has a paradoxical nature : temporal segmentation of primitives and identification of constituent dynamical systems need to be solved simultaneously. To overcome this problem, we propose a multiphase parameter estimation method that consists of a bottom-up clustering phase of linear dynamical systems and a refinement phase of all the system parameters. Experimental results show the method can organize hidden dynamical systems behind the training data and refine the system parameters successfully.
表情譜: タイミング構造に基づく表情の記述・生成・認識

川嶋宏彰, 西山正紘, 松山隆司

情報科学技術レターズ, 153-156, Sep, 2005 Peer-reviewed
Hierarchical Clustering of Dynamical Systems based on Eigenvalue Constraints

Hiroaki Kawashima, Takashi Matsuyama

3rd International Conference on Advances in Pattern Recognition (S. Singh et al. Eds.: ICAPR 2005 Springer LNCS 3686), 229-238, Aug, 2005 Peer-reviewed
間の合った発話タイミング制御を目的とした漫才の動的構造の分析

スコギンズ・リーバイ, 川嶋宏彰, 松山隆司

インタラクション2005, (D-404) 1-2, Mar, 2005 Peer-reviewed
Facial expression representation based on timing structures in faces

M Nishiyama, H Kawashima, T Hirayama, T Matsuyama

ANALYSIS AND MODELLING OF FACES AND GESTURES, PROCEEDINGS, 3723 140-154, 2005 Peer-reviewed

This paper presents a method for interpreting facial expressions based on temporal structures among partial movements in facial image sequences. To extract the structures, we propose a novel facial expression representation, which we call a facial score, similar to a musical score. The facial score enables us to describe facial expressions as spatio-temporal combinations of temporal intervals; each interval represents a simple motion pattern with the beginning and ending times of the motion. Thus, we can classify fine-grained expressions from multivariate distributions of temporal differences between the intervals in the score. In this paper, we provide a method to obtain the score automatically from input images using bottom-up clustering of dynamics. We evaluate the efficiency of facial scores by comparing the temporal structure of intentional smiles with that of spontaneous smiles.
力学系の自己組織化に基づく唇映像の構造化

川嶋宏彰, 堤公孝, 松山隆司

第7回情報論的学習理論ワークショップ(IBIS), 86-93, Nov, 2004 Peer-reviewed
動的イベントの分節化・学習・認識のための Hybrid Dynamical System

川嶋宏彰, 堤公孝, 松山隆司

情報科学技術レターズ, 175-178, Sep, 2004 Peer-reviewed
Multi-viewpoint gesture recognition by an integrated continuous state machine

Hiroaki Kawashima, Takashi Matsuyama

Systems and Computers in Japan, 34(14) 1-12, Dec, 2003 Peer-reviewed

This paper proposes a system architecture for event recognition that dynamically integrates information from multiple sources (e.g., multimodal data from visual and auditory sensors). The proposed system consists of multiple event classifiers called Continuous State Machines (CSMs). Each CSM has a state transition rule in a continuous state space and classifies time-varying patterns from a different single source. Since the rule is defined as an extension of Kalman filters (i.e., the next state is deduced from the trade-off scheme between the input data and the model's prediction), CSMs support dynamic time warping and robustness against noise. We then introduce an interaction method among CSMs to classify events from multiple sources. A continuous state space (i.e., vector space) allows us to design interaction as minimization of an energy function. This interaction enables the system to dynamically suppress unreliable classifiers and improves system reliability and the accuracy of classifying events in dynamically changing situations (e.g., the object is temporary occluded from one of multiple cameras in a gesture recognition task). Experimental results on gesture recognition by two cameras show the effectiveness of our proposed system.
連続状態モデル間の相互作用に基づく多視点動作認識

川嶋宏彰, 松山隆司

電子情報通信学会論文誌, J85-D-II(12) 1801-1812, Dec, 2002 Peer-reviewed
Integrated event recognition from multiple sources

H Kawashima, T Matsuyama

16TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL II, PROCEEDINGS, 2 785-789, 2002 Peer-reviewed

This paper proposes a system architecture for event recognition that integrates information from multiple sources (e.g., gesture and speech recognition from distributed sensors in the real world). The proposed system consists of multiple recognizers named Continuous State Machines (CSMs). Each CSM has a state transition rule in a continuous state space and classifies time-varying patterns from a single source. Since the rule is defined as a simplification of Kalman filter (i.e., the next state is deduced from the trade-off scheme between input data and model's prediction), CSMs support dynamic time warping and robustness against noise. We then introduce an interaction method among CSMs to classify events from multiple sources. A continuous state space (i.e., vector space) allows as to design interaction as recursively minimizing an energy function. This interaction enables the system to dynamically focus over the multiple sources, and improves reliability and accuracy of classifying events in dynamically changing situations (e.g., the object is temporally occluded from one of multiple cameras in a gesture recognition task). Experimental results on gesture recognition by two cameras show the effectiveness of our proposed system.

Misc.

深層強化学習を用いたバドミントンダブルスのフォーメーション評価

中原永登, 川嶋宏彰

情報処理学会第86回全国大会, Mar, 2024
マルチエージェント強化学習を用いた効率的未踏領域探査のための共有情報の比較

野中和典, 川嶋宏彰

情報処理学会第86回全国大会, Mar, 2024
ビデオ講義における生成講師映像の頭部動作が学習者に与える影響

村上拓真, 川嶋宏彰

情報処理学会第86回全国大会, Mar, 2024
魚同士の重なり判定の改良による高精度複数個体追跡

三宅奏壱朗, 新里高行, 川嶋宏彰, 波部斉

情報処理学会第86回全国大会, Mar, 2024
大域的な対応付け最適化による魚の複数個体追跡

村上友一, 新里高行, 川嶋宏彰, 波部斉

情報処理学会第86回全国大会, Mar, 2024

Books and Other Publications

機械学習の可能性

浮田浩行, 濱上知樹, 藤吉弘亘, 大町真一郎, 戸田智基, 岩崎敦, 小林泰介, 鈴木亮太, 木村雄喜, 橋本大樹, 玉垣勇樹, 水谷麻紀子, 永田毅, 木村光成, 李晃伸, 川嶋宏彰 (Role: Joint author, 第11章11ページ)

コロナ社, Jan, 2023 (ISBN: 9784339033854)
Computer Vision: A Reference Guide

Katsushi Ikeuchi (Editor) (Role: Contributor, Active Appearance Models)

Springer, Oct 14, 2021 (ISBN: 3030634159)
データサイエンス入門 (Pythonによるビジネスデータサイエンス 1)

笹島宗彦(編) (Role: Contributor, p.36-50（3.2 相関）)

朝倉書店, Apr 5, 2021 (ISBN: 4254129114)
Large-Scale Networks in Engineering and Life Sciences

P. Benner, R. Findeisen, D. Flockerzi, U. Reichl, K. Sundmacher (Role: Contributor, Chap.3, Magnus Egerstedt, Jean-Pierre de la Croix, Hiroaki Kawashima, and Peter Kingston, "Interacting with Networks of Mobile Agents")

Birkhauser-Springer, 2014
よくわかる認知科学

乾敏郎, 川口潤, 吉川左紀子 (Role: Contributor, 第I部第10章「タイミング」)

ミネルヴァ書房, 2010

Presentations

群れとデータサイエンス

川嶋宏彰

学習院桜友会寄付講座（生命情報社会学）シンポジウム, Feb 18, 2023 Invited
生物の群れや群ロボットのインタラクションモデリング

川嶋宏彰

日本機械学会関西支部2022年度特別フォーラム, Sep 17, 2022 Invited
教育データ分析2022 ～COVID-19流行前後の比較分析～

川嶋宏彰

京都大学学術情報メディアセンターセミナー, Apr 19, 2022 Invited
Modeling and Control of Interactions – Human, Machine, and Swarm

Hiroaki Kawashima

International Symposium on Intelligent Computing Systems, Mar 25, 2022 Invited
教育データ分析コンテスト「COVID-19流行前後の比較分析」

川嶋宏彰

情報処理学会教育学習支援情報システム（CLE）研究会（第36回研究発表会）, Mar 9, 2022

Major Teaching Experience

Oct, 2021 - Present

データ科学特論 (兵庫県立大学情報科学研究科)
Apr, 2021 - Present

Social Data Analysis (University of Hyogo (School of Social Information Science))
Apr, 2021 - Present

Machine Learning (Advanced) (University of Hyogo (Graduate School of Information Science))
Apr, 2021 - Present

Machine Learning (University of Hyogo (School of Social Information Science))
Oct, 2020 - Present

PBL演習2 (兵庫県立大学社会情報科学部)
Apr, 2020 - Present

Data Analysis Practice (University of Hyogo, School of Social Information Science)
Oct, 2019 - Present

Probability and Statistics (University of Hyogo (School of Social Information Science))
Apr, 2019 - Present

Introduction of Social Information Science (University of Hyogo (School of Social Information Science))

Professional Memberships

Research Projects

Hierarchical Bio-Navigation Integrating Cyber-Physical Space

Grants-in-Aid for Scientific Research Grant-in-Aid for Transformative Research Areas (A), Japan Society for the Promotion of Science, Sep, 2021 - Mar, 2026
Data-Driven Swarm-Machine Interaction

Grants-in-Aid for Scientific Research Grant-in-Aid for Transformative Research Areas (A), Japan Society for the Promotion of Science, Sep, 2021 - Mar, 2026
未知未踏領域における拠点建築のための集団共有知能をもつ進化型ロボット群

ムーンショット型研究開発事業, 科学技術振興機構, Apr, 2023 - Nov, 2025

國井康晴, 川嶋宏彰, 宮口幹太, 吉光徹雄, 安藤慶昭, 廣瀬智之, 前田孝雄
Constructing a Fine-grained Learning Analytics Loop Based on the Observation of Learners' Behavior

Grants-in-Aid for Scientific Research, Japan Society for the Promotion of Science, Apr, 2019 - Mar, 2023
Information infrastructure for supporting education and learning by using educational big data

Grants-in-Aid for Scientific Research Grant-in-Aid for Scientific Research (S), Japan Society for the Promotion of Science, May, 2016 - Mar, 2021

Ogata Hiroaki

Industrial Property Rights

特願PCT/JP2012/000508 関心度推定装置および方法

坂田幸太郎, 前田茂則, 米谷竜, 川嶋宏彰, 平山高嗣, 松山隆司
特願PCT/JP2010/003700 注視対象判定装置及び注視対象判定方法

坂田幸太郎, 前田茂則, 米谷竜, 川嶋宏彰, 平山高嗣, 松山隆司

Academic Activities

ACMMM2023

Peer review

May 11, 2023 - Nov 3, 2023
MVA2023

Peer review

Apr, 2023 - May, 2023
BiRD2023

Peer review

Dec 5, 2022 - Mar 13, 2023
人工知能学会論文誌

Peer review

Oct, 2022 - Mar, 2023
AAAI2023

Peer review

Aug 7, 2022 - Feb 14, 2023

Social Activities

人工知能や機械学習におけるモデル化技術

Guest

ラジオ関西, 水曜ききもん「こちら兵庫県立大学です！」, Feb 7, 2024
人の動きや行動を分析する機械学習技術

Lecturer

兵庫県立大学成長分野における即戦力人材輩出に向けたリカレント教育推進事業, 2023年度DX概論, Jan 20, 2024
人の動きや行動を分析する機械学習技術

Lecturer

ひょうご講座【データサイエンス】－ビッグデータ、AIとその周辺を読みとく, Oct 3, 2023
群れのインタラクション，群れとのインタラクション

Lecturer

サイエンスフェア in 兵庫, Jan 29, 2023
社会情報科学部における機械学習・データ分析の教育と研究

Lecturer

兵庫県立大学AI入門セミナー, Dec 2, 2022

To the list screen