研究者業績

基本情報

所属
武蔵野大学 データサイエンス学部 データサイエンス学科 教授
学位
Ph.D.(Sirindhorn International Institute of Technology, Thailand)

通称等の別名
Kaeng-SOM
J-GLOBAL ID
201901012821607047
researchmap会員ID
B000350703

In the age that information blows into us anywhere and anytime, it enables people to access across various sciences easily. The combination of science, arts and humanity can occur serenely. I believe that science and mind are compatible. My deep passion is to apply various academic aspects including truth together for the benefit and highest happiness of mankind. I therefore became a part of many important multidisciplinary project such as the Multi-lingual Machine Translation Project (1989-1995), Language Resource Development for Computer Science (ORCHID), E-Learning for Minor Languages, Information Literacy, Application of IOT for the elderly and Well-Being, Digital Cultural Information Development (Digitized Thailand), Digital Thinking, Social Entrepreneur, University Entrepreneur, Start up Promotion Support and so on.

I have an M.A. and B.A. in Arts and Linguistics and Ph.D. in Information Technology. All degrees and experiences from the real implemented projects granted me as part of a member of the National Electronics and Computer Technology Center, Thailand (1994-2014), Faculty of Informatics, Burapha University of Thailand (2014-2019). Currently, I am a member of the Faculty of Data Science, and Asia AI Institute of Musashino University, Japan.

My research interests include Language Resource Development, Semantics, Syntactic, Morphological Analysis, Machine Translation, Language Intermediate Representation, Natural Language Processing, Social Innovation, Business Model and Inspiration, Social Understanding,  Standardization on Heritage Information, and Cultural and Historic Digitization.

論文

 38
  • Virach Sornlertlamvanich, Thatsanee Charoenporn, Somrudee Deepaisarn
    Frontiers in Artificial Intelligence and Applications 2024年1月16日  
    The Thammasat AI City distributed platform is a proposed AI platform designed to enhance city intelligent management. It addresses the limitations of current smart city architecture by incorporating cross-domain data connectivity and machine learning to support comprehensive data collection. In this study, we delve into two main areas, that is, monitoring and visualization of city ambient lighting, and indoor human physical distance tracking. The smart street light monitoring system provides real-time visualization of street lighting status, energy consumption, and maintenance requirement, which helps to optimize energy consumption and maintenance reduction. The indoor camera-based system for human physical distance tracking can be used in public spaces to monitor social distancing and ensure public safety. The overall goal of the platform is to improve the quality of life in urban areas and align with sustainable urban development concepts.
  • Virach Sornlertlamvanich, Pawinee Iamtrakul, Teerayuth Horanont, Narit Hnoohom, Konlakorn Wongpatikaseree, Sumeth Yuenyong, Jantima Angkapanichkit, Suthasinee Piyapasuntra, Prittipoen Lopkerd, Santirak Prasertsuk, Chawee Busayarat, I-soon Raungratanaamporn, Somrudee Deepaisarn, Thatsanee Charoenporn
    Frontiers in Artificial Intelligence and Applications 2023年1月23日  
    This research proposes an AI platform for data sharing across multiple domains. Since the data in the smart city concept are domain-specific processed, the existing smart city architecture is suffered from cross-domain data interpretation. To go beyond the digital transformation efforts in smart city development, the AI city is created on the architecture of cross-domain data connectivity and transform learning in the machine learning paradigm. In this research, the health and human behavioral data are targeted on human traceability and contactless technologies. To measure the inhabitants quality of life (QoL), the primary emotion expression study is conducted to interpret the emotional states and the mental health of people in the urbanized city. The results of information augmentation draw attention to the immersive visualization of the Thammasat model.
  • Taiki Kimura, Thatsanee Charoenporn, Virach Sornlertlamvanich
    2022 International Electronics Symposium (IES) 2022年8月9日  
  • Virach Sornlertlamvanich, Thatsanee Charoenporn, Shiori Sasaki, Yasushi Kiyoki
    IIAI-AAI-Winter 225-229 2022年  
  • Virach Sornlertlamvanich, Kitiya Suriyachay, Thatsanee Charoenporn
    Human Language Technology. Challenges for Computer Science and Linguistics 143-160 2022年  
  • Goragod PONGTHANISORN, Waranrach VIRIYAVIT, Thatsanee CHAROENPORN, Virach SORNLERTLAMVANICH
    2021年9月  査読有り
  • Kitiya Suriyachay, Thatsanee Charoenporn, Virach Sornlertlamvanich, Natsuda Kaothanthong
    Journal of Science & Technology Asia (STA) 26(2) 61-78 2021年7月  査読有り
  • Virach Sornlertlamvanich, Nannam Aksorn, Thatsanee Charoenporn
    International Conference Language Technologies for All (LT4All) 2019年12月  査読有り最終著者
  • Thatsanee Charoenporn, Virach Sornlertlamvanich
    International Conference Language Technologies for All (LT4All) 2019年12月  査読有り筆頭著者
  • Ari Yanase, Virach Sornlertlamvanich, Thatsanee Charoenporn
    International Conference Language Technologies for All (LT4All) 2019年12月  査読有り最終著者
  • Canasai Kruengkrai, Virach Sornlertlamvanich, Watchira Buranasing, Thatsanee Charoenporn
    Proceedings of the 3rd Workshop on South and Southeast Asian Natural Language Processing(WSSANLP@COLING) 15-24 2012年  査読有り
  • Canasai Kruengkrai, Thatsanee Charoenporn, Virach Sornlertlamvanich
    Proceedings of the 3rd Named Entities Workshop(NEWS@IJCNLP) 28-31 2011年  査読有り
  • Sineenat Tienkouw, Nutvadee Wongtosrad, Paramet Tanwanont, Rattapoom Niraswan, Chaimongkon Khlayprapha, Pisan Taesuwan, Thanate Muangthong, Sapa Chanyachatchawan, Rattapoom Tuchinda, Thatsanee Charoenporn, Virach Sornlertlamvanich
    PICMET: Portland International Center for Management of Engineering and Technology, Proceedings 2011年  査読有り
    In this paper, we present a strategic marketing plan combined with Technology-Based Marketing approach (TBM) and Competitive Strategy approach for an intelligent travel planning system named Pi-Pe which uses artificial intelligence to help users easily create their one-day trip schedule with three simple steps: (1) Select the date and time (2) State the starting location and (3) Pick the destinations. The purpose of the paper is to develop a set of technology-based marketing plan for Pi-Pe in the pursuit of competitive advantage to drive up product value and create sustained commercial advantage. © 2011 IEEE.
  • Virach Sornlertlamvanich, Thatsanee Charoenporn
    Proceedings - 2011 2nd International Conference on Culture and Computing, Culture and Computing 2011 92-97 2011年  査読有り
    Since 2006, by the Ministry of Culture, there was an effort in creating cultural portal. Each provincial cultural office was assigned to survey and report any cultural practice according to a designed template. The technological potential of each office was so different and most of them had to relied on the local system developer and service provider. As a result, the collected contents cannot fulfill the requirement of documentation standard and service level of media presentation. In contrast, cultural knowledge is redefined to an agreed structure for interoperability and computability. Moreover, the contents must reflect the up-to-date daily practice and thoroughly be covered by any individuals. We reuse the existing contents by recovering any typos and format distortion. The social networking system is prepared to allow individual participation to co-create the contents under an authorized supervision. As a result, more than 80 percents of the original contents have been recovered. Each provincial office is able to strategically plan and co-create their own contents to establish the total cultural knowledge. © 2011 IEEE.
  • Kergrit Robkop, Sareewan Thoongsup, Thatsanee Charoenporn, Virach Sornlertlamvanich, Hitoshi Isahara
    Global Wordnet Conference, GWC 2010 2010年  
    This paper deals with the development of a platform for Asian WordNet (AWN) construction. Not only for the diversity of the languages using in Asia, we also need a platform that can connect the distributedly developing WordNet to establish a network for the cross language WordNet. Each WordNet is created independently by referring to the original Princeton WordNet (PWN) as the focal representation. The Asian WordNet Management System (WNMS) is proposed as a distributed management system that allows the server for each WordNet interchange requests with each other to perform a cross language WordNet interfacing, including the fundamental web service utilities for editing, visualizing and exporting.
  • Virach Sornlertlamvanich, Thatsanee Charoenporn, Hitoshi Isahara
    Proceedings of the 7th International Conference on Language Resources and Evaluation, LREC 2010 514-517 2010年  査読有り
    This paper presents the language resource management system for the development and dissemination of Asian WordNet (AWN) and its web service application. We develop the platform to establish a network for the cross language WordNet development. Each node of the network is designed for maintaining the WordNet for a language. Via the table that maps between each language WordNet and the Princeton WordNet (PWN), the Asian WordNet is realized to visualize the cross language WordNet between the Asian languages. We propose a language resource management system, called WordNet Management System (WNMS), as a distributed management system that allows the server to perform the cross language WordNet retrieval, including the fundamental web service applications for editing, visualizing and language processing. The WNMS is implemented on a web service protocol therefore each node can be independently maintained, and the service of each language WordNet can be called directly through the web service API. In case of cross language implementation, the synset ID (or synset offset) defined by PWN is used to determined the linkage between the languages.
  • Virach Sornlertlamvanich, Thatsanee Charoenporn, Kergrit Robkop, Chumpol Mokarat, 井佐原 均
    JAPIO 2009 Year Book 2009年9月  査読有り
    アジアワードネットの開発手法を述べた。
  • Takenobu Tokunaga, Dain Kaplan, Nicoletta Calzolari, Monica Monachini, Claudia Soria, Virach Sornlertlamvanich, Thatsanee Charoenporn, Xia YingJu, Chu-Ren Huang, Shu-Kai Hsieh, Kiyoaki Shirai
    pp. 145-152 2009年8月  査読有り
  • Tokunaga Takenobu, Dain Kaplan, Chu Ren Huang, Shu Kai Hsieh, Nicoletta Calzolari, Monica Monachini, Claudia Soria, Shirai Kiyoaki, Virach Sornlertlamvanich, Thatsanee Charoenporn, Xia Yingju
    Proceedings of the 6th International Conference on Language Resources and Evaluation, LREC 2008 1658-1663 2008年  査読有り
    Corpus-based approaches and statistical approaches have been the main stream of natural language processing research for the past two decades. Language resources play a key role in such approaches, but there is an insufficient amount of language resources in many Asian languages. In this situation, standardisation of language resources would be of great help in developing resources in new languages. This paper presents the latest development efforts of our project which aims at creating a common standard for Asian language resources that is compatible with an international standard. In particular, the paper focuses on i) lexical specification and data categories relevant for building multilingual lexical resources for Asian languages; ii) a core upper-layer ontology needed for ensuring multilingual interoperability and iii) the evaluation platform used to test the entire architectural framework.
  • Virach Sornlertlamvanich, Thatsanee Charoenporn, Suphanut Thayaboon,Chumpol Mokarat, Hitoshi Isahara
    Third International Joint Conference on Natural Language Processing, IJCNLP 2008, Hyderabad, India, January 7-12, 2008 105-106 2008年1月  査読有り
    タイラボで開発した言語資源構築用ツールについて述べた。
  • Virach Sornlertlamvanich, Thatsanee Charoenporn, Chumpol Mokarat, Hitoshi Isahara, Hammam Riza, Purev Jaimai
    Third International Joint Conference on Natural Language Processing, IJCNLP 2008, Hyderabad, India, January 7-12, 2008 673-678 2008年1月  査読有り
    We explore an automatic WordNet synset assignment to the bi-lingual dictionaries.
  • Virach Sornlertlamvanich, Thatsanee Charoenporn, Shisanu Tongchim, Canasai Kruengkrai, Hitoshi Isahara
    IEICE Transactions on Information and Systems E90-D(10) 1565-1573 2007年10月  査読有り
    Several approaches have been studied to cope with the exceptional features of non-segmented languages. When there is no explicit information about the boundary of a word, segmenting an input text is a formidable task in language processing. Not only the contemporary word list, but also usages of the words have to be maintained to cover the use in the current texts. The accuracy and efficiency in higher processing do heavily rely on this word boundary identification task. In this paper, we introduce some statistical based approaches to tackle the problem due to the ambiguity in word segmentation. The word boundary identification problem is then defined as a part of others for performing the unified language processing in total. To exhibit the ability in conducting the unified language processing, we selectively study the tasks of language identification, word extraction, and dictionary-less search engine. Copyright © 2007 The Institute of Electronics, Information and Communication Engineers.
  • Thatsanee Charoenporn, Sareewan Thoongsup, Virach Sornlertlamvanich, 井佐原 均
    SEALS XVII Conference 2007年8月  査読有り
    This is a part of the Development of Language Resource Standard for Semantic Web Applications Project.
  • Thatsanee Charoenporn, Canasai Kruengkrai, Thanaruk Theeramunkong, Virach Sornlertlamvanich
    IEICE Transactions on Information and Systems E90-D(4) 775-782 2007年3月  査読有り
    Manually collecting contexts of a target word and grouping them based on their meanings yields a set of word senses but the task is quite tedious. Towards automated lexicography, this paper proposes a word-sense discrimination method based on two modern techniques; EM algorithm and principal component analysis (PCA). The spherical Gaussian EM algorithm enhanced with PCA for robust initialization is proposed to cluster word senses of a target word automatically. Three variants of the algorithm, namely PCA, sGEM, and PCA-sGEM, are investigated using a gold standard dataset of two polysemous words. The clustering result is evaluated using the measures of purity and entropy as well as a more recent measure called normalized mutual information (NMI). The experimental result indicates that the proposed algorithms gain promising performance with regard to discriminate word senses and the PCA-sGEM outperforms the other two methods to some extent. Copyright © 2007 The Institute of Electronics, Information and Communication Engineers.
  • Thatsanee Charoenporn, Virach Sornlertlamvanich, Chumpol Mokarat, Hitoshi Isahara, Hammam Riza, Purev Jaimai
    GWC 2008: 4th Global WordNet Conference, Proceedings 101-110 2007年  査読有り
    This paper describes an automatic WordNet synset assignment to the existing bi-lingual dictionaries of languages having limited lexicon information. Generally, a term in a bi-lingual dictionary is provided with very limited information such as part-of-speech, a set of synonyms, and a set of English equivalents. This type of dictionary is comparatively reliable and can be found in an electronic form from various publishers. In this paper, we propose an algorithm for applying a set of criteria to assign a synset with an appropriate degree of confidence to the existing bi-lingual dictionary. We show the efficiency in nominating the synset candidate by using the most common lexical information. The algorithm is evaluated against the implementation of Thai-English, Indonesian-English, and Mongolian-English bi-lingual dictionaries. The experiment also shows the effectiveness of using the same type of dictionary from different sources. © University of Szeged, Department of Informatics, 2007. All rights are reserved.
  • Virach Sornlertlamvanich, Thatsanee Charoenporn, Kergrit Robkop, Hitoshi Isahara
    GWC 2008: 4th Global WordNet Conference, Proceedings 419-427 2007年  査読有り
    This paper describes a multi-lingual WordNet construction tool, called KUI (Knowledge Unifying Initiator), which is a knowledge user interface for online collaborative knowledge construction. KUI facilitates online community in developing and discussing multi-lingual WordNet. KUI is a sort of social networking system that unifies the various discussions following the process of thinking model, i.e. initiating the topic of interest, collecting the opinions to the selected topics, localizing the opinions through the translation or customization and finally posting for public hearing to conceptualize the knowledge. The process of thinking is done under the selectional preference simulated by voting mechanism in the case that there are many alternatives. By measuring the history of participation of each member, KUI adaptively manages the reliability of each member's opinion and vote according to the estimated ExpertScore. As a result, the multi-lingual WordNet can be created online and produce a reliable result. © University of Szeged, Department of Informatics, 2007. All rights are reserved.
  • Virach Sornlertlamvanich, Thatsanee Charoenporn, Kergrit Robkop, Hitoshi Isahara
    Intercultural Collaboration, First International Workshop, IWIC 2007, Kyoto, Japan, January 25-26, 2007, Invited and Selected Papers 91-102 2007年1月  査読有り
    インターネット上のコラボレーションツールKUIについて報告した。
  • Tokunaga Takenobu, Virach Sornlertlamvanich, Thatsanee Charoenporn, Nicoletta Calzolari, Monica Monachini, Claudia Sonia, Chu-Ren Huang, Xia YingJu, Yu Hao, Laurent Prevot, Shirai Kiyoaki
    Proceedings of the COLING/ACL 2006 827-834 2006年7月  査読有り
  • Thatsanee Charoenporn, Canasai Kruengkrai, Thanaruk Theeramunkong, Virach Sornlertlamvanich
    IEICE Transactions on Information and Systems E89-D(7) 2286-2293 2006年7月  査読有り
    A lexicon is an important linguistic resource needed for both shallow and deep language processing. Currently, there are few machine-readable Thai dictionaries available, and most of them do not satisfy the computational requirements. This paper presents the design of a Thai lexicon named the TCL's Computational Lexicon (TCLLEX) and proposes a method to construct a large-scale Thai lexicon by re-using two existing dictionaries and a large number of texts on the Internet. In addition to morphological, syntactic, semantic case role and logical information in the existing dictionaries, a sort of semantic constraint called selectional preference is automatically acquired by analyzing Thai texts on the web and then added into the lexicon. In the acquisition process of the selectional preferences, the so-called Bayesian Information Criterion (BIC) is applied as the measure in a tree cut model. The experiments are done to verify the feasibility and effectiveness of obtained selection preferences. Copyright © 2006 The Institute of Electronics, Information and Communication Engineers.
  • Thatsanee Charoenporn, Canasai Kruengkrai, Thanaruk Theeramunkong, Virach Sornlertlamvanich, Hitoshi Isahara
    Proceedings of the 5th International Conference on Language Resources and Evaluation, LREC 2006 949-954 2006年  査読有り
    The growing of multilingual information processing technology has created the need of linguistic resources, especially lexical database. Many attempts were put to alter the traditional dictionary to computational dictionary or widely named as computational lexicon. TCL's Computational Lexicon (TCLLEX) is a recent development of a large-scale Thai Lexicon which aims to serve as a fundamental linguistic resource for natural language processing research. We design either terminology or ontology for structuring the lexicon based on the idea of computability and reusability.
  • Canasai Kruengkrai, Thatsanee Charoenporn, Virach Sornlertlamvanich, 井佐原 均
    International Conference On Natural Language Processing (ICON 2004) 2004年12月  査読有り
    辞書の開発のため、テキストから単語の持つ意味情報を自動的に獲得する手法を開発した。
  • Thatsanee Charoenporn, Canasai Kruengkrai, Virach Sornlertlamvanich, Thanaruk Theeramunkong, 井佐原 均
    Third International Workshop on DICTIONARY WRITING SYSTEMS (DWS 2004) 2004年9月  査読有り
    大量の電子化テキストに基づく辞書開発のための枠組みを提案する。
  • Thatsanee Charoenporn, Pornchai Tummarattananont, Virach Sornlertlamvanich, 井佐原 均
    International Conference on eLearning for Knowledge-Based Society 2004年5月  査読有り
    新しい教育の実現のためのE-learning について述べた。
  • Canasai Kruengkrai, Thatsanee Charoenporn, Virach Sornlertlamvanich, 井佐原均
    IJCNLP-04 668-675 2004年3月  査読有り
    大量の電子化テキストに基づく辞書開発のための枠組みを提案する。大規模データからの語義の自動獲得を効率的に行う手法を開発した。
  • Thatsanee Charoenporn, Canasai Kruengkrai, Virach Sornlertlamvanich, 井佐原均
    Fourth Workshop on Asia Language Resources 47-53 2004年3月  査読有り
    大量の電子化テキストに基づく辞書開発のための枠組みを提案する。
  • Thatsanee Charoenporn, Virach Sornlertlamvanich, Sawit Kasuriya, Chatchawarn Hansakunbuntheung, Hitoshi Isahara
    Proceedings of the 4th International Conference on Language Resources and Evaluation, LREC 2004 1193-1196 2004年  査読有り
    Language Resources are recognized as an essential component in linguistic infrastructure and a starting point of Natural Language Processing systems and applications. In this paper, we describe the achievement of the development and the use of Thai Language Resources germinated with an open collaboration platform, under the collaboration between research institutes. The resources include either text or speech. Text resources are divided into lexicon database and annotated corpus. We started developing a corpus-based Thai-English lexicon database (LEXiTRON) since 1994. It was originated from a dictionary designed for using in developing a machine translation system. Since then the Thai POS was designed and evaluated in several applications (word segmentation, machine translation, grapheme-to-phoneme, etc.) Extending the lexicon database, POS tagged corpus (ORCHID), and speech corpora for both synthesis and recognition are developed and functioned as an important part of research and development on NLP or HLT. These language resources are available for academic experiment.
  • Canasai Kruengkrai, Thatsanee Charoenporn, Virach Sornlertlamvanich, Hitoshi Isahara
    Proceedings of the 4th International Conference on Language Resources and Evaluation, LREC 2004 2201-2204 2004年  査読有り
    A statistical corpus-based approach for acquiring selectional preferences of verbs is proposed. By parsing through text corpora, we obtain examples of context nouns that are considered to be the selectional preferences of a given verb. The approach is to generalize initial noun classes to the most appropriate levels on a semantic hierarchy. We present an iterative algorithm for generalization by combining an agglomerative merging and a model selection technique called the Bayesian Information Criterion (BIC). In our experiments, we consider the Web as the large corpora. We also propose approaches for extracting examples from the Web. Preliminarily experimental results are given to show the feasibility and effectiveness of our approach.
  • Thatsanee Charoenporn, Virach Sornlertlamvanich, Sawit Kasuriya, Chatchawarn hansakunbuntheung, 井佐原 均
    ELSNET/ENABLER Workshop 2003年8月  査読有り
    タイ語言語資源の強調的開発の現状について報告する。

MISC

 1

講演・口頭発表等

 21

担当経験のある科目(授業)

 12