Beyond XGBoost and SHAP: Unveiling true feature importance

Yoshiyasu Takefuji

Journal of Hazardous Materials 488 2025年5月5日査読有り

This paper outlines key machine learning principles, focusing on the use of XGBoost and SHAP values to assist researchers in avoiding analytical pitfalls. XGBoost builds models by incrementally adding decision trees, each addressing the errors of the previous one, which can result in inflated feature importance scores due to the method's emphasis on misclassified examples. While SHAP values provide a theoretically robust way to interpret predictions, their dependence on model structure and feature interactions can introduce biases. The lack of ground truth values complicates model evaluation, as biased feature importance can obscure real relationships with target variables. Ground truth values, representing the actual labels used in model training and validation, are crucial for improving predictive accuracy, serving as benchmarks for comparing model outcomes to true results. However, they do not ensure real associations between features and targets. Instead, they help gauge the model's effectiveness in achieving high accuracy. This paper underscores the necessity for researchers to recognize biases in feature importance and model evaluation, advocating for the use of rigorous statistical methods to enhance the reliability of analyses in machine learning research.
Challenges in feature importance interpretation: Analyzing LSTM-NN predictions in battery material flotation

Yoshiyasu Takefuji

Journal of Industrial Information Integration 45 2025年5月査読有り

Gomez-Flores et al. proposed a Long Short-Term Memory Neural Network (LSTM-NN) for predicting the flotation behavior of battery active materials using various physicochemical and hydrodynamic variables. While they achieved high prediction accuracy, validated through Mean Relative Error (MRE) and Mean Squared Error (MSE) metrics, concerns arise regarding the integrity of feature importance assessments derived from SAGE and SHAP methodologies. Specifically, the reliance on these model-specific techniques can introduce biases, obscuring the true relationships between features. Additionally, while Spearman's correlation elucidated significant relationships among variables, the absence of discussion on p-values left gaps in interpretation. This study emphasizes the need for cautious interpretation of feature importance metrics and the elimination of less significant variables, aiming to enhance model robustness and improve actionable insights in machine learning contexts.
Enhancing Feature Importance Analysis in Battery Research: A Statistical Methods Perspective on Machine Learning Limitations

Yoshiyasu Takefuji

Energy Storage Materials 2025年4月査読有り
Visualizing disparity trends on felony sentence-imposed months by gender and race with generative AI

Yoshiyasu Takefuji

Cities 159 2025年4月査読有り

This study analyzes trends in felony sentence disparity based on gender and race from 2010 to 2024. It utilizes a generative AI to create Python code for data visualization and employs three statistical methods (ANOVA, Chi-Square, Fisher's Exact) to assess p-values. The p-value signifies the probability of random chance causing the observed association. A significance level of 0.05 is used as a benchmark. The evidence-based analysis reveals a concerning trend: increasing disparities in sentences across genders and races. The findings highlight the need for further research and policy changes to address these disparities in the criminal justice system. The paper offers a novel visualization approach to depict these trends, aiding comprehension of the issue.
Assessing COVID‐19 Outcomes in Light of Cultural Norms and Policy Changes: A Time‐Series Analysis Tool Grounded in Safety‐Focused Policy Science

Yoshiyasu Takefuji

World Medical & Health Policy 2025年3月13日査読有り

Cultural norms and traditional behaviors have significantly influenced the outcomes of the COVID-19 pandemic. Asian countries initially outperformed their Western counterparts due to their cultural practices. However, policy shifts have led to a decline in these countries' performance. The objective of this study is to scrutinize the performance of different countries in managing the COVID-19 pandemic, with a focus on the influence of cultural norms and traditional behaviors, and to propose a tool that can inform and enhance current urban management policies. This study employs a time-series policy outcome analysis tool that operates on a single metric: the daily cumulative mortality of the population. By implementing a test-isolation strategy to manage quarantine periods, this tool aims to significantly influence the pandemic's outcome. The tool's efficacy is showcased through a case study involving four countries. New insights are validated and visualized via generated graphs, demonstrating the potential of this tool in the realm of tourism and urban management. This proposed tool holds promise for informing and enhancing current urban management policies, thereby mitigating unnecessary tourism-related fatalities in future pandemics. It underscores the importance of having the right information at the right time to make informed decisions in response to a pandemic.

もっとみる

MISC

199

ニューラルコンピューティングの遊び方 25 ニューラルエージェントの学習と進化

井庭崇, 武藤佳恭

BIT (Tokyo) 29(11) 104-109 1997年
開発途上国の疾病対策に貢献できる医学教育のあり方 (長崎大学熱帯医学研究所S)

多田功, 武藤佳恭, 長谷川恭子, 伊藤亮, 佐藤国雄, 門司和彦, 藤村建夫, 溝田勉

長崎大学熱帯医学研究所共同研究報告集 1997 92-93 1997年
ニュ-ラルコンピュ-ティングの遊び方(14)結婚式の席配置問題

大来進, 武藤佳恭

Bit 28(12) 25-28 1996年12月
ニュ-ラルコンピュ-ティングの遊び方-11-ナイトツア-問題に挑戦

井庭崇, 斎藤孝之, 武藤佳恭

Bit 28(9) 87-90 1996年9月
ニュ-ラルコンピュ-ティングの遊び方(10)電話連絡網問題

大島典子, 小川智明, 武藤佳恭

Bit 28(8) 96-99 1996年8月
ニューラルコンピューティングによる小選挙区区割り手法

斎藤孝之, 武藤佳恭

情報処理学会論文誌 37(4) 588-596 1996年4月15日

1994年12月に衆議院の小選挙区比例代表並立制導入に伴う全国300小選挙区の区割り法が成立した. この区割りの作業には「人口の格差を2倍以内におさえる」「飛び地にしない」などのいくつかの条件がある. 条件が増えれば増えるほど複雑さが増して作成が困難になる. これを電卓片手に人手で行うとなると大変な労力となるが実際の選挙区作成は手作業で行われている. この論文ではニューラルネットワークを用いたコンピュータによる選挙区の自動生成の手法を紹介する. ここでは東京都の選挙区の作成を試みた. いくつかの条件を考慮して作成したところ人口格差の点で現在の区割り案(1.47倍)よりもよい結果を(1.28倍)生成できた.In December 1994, Japanese minor electorate system (single-member constituency system)for the House of Representatives was established where Japan is divided into three hundred constituencies. A single representative will be elected from each constituency. Zoning three hundred constituency was accomplished by hand calculators in Japan, although the constituency zoning is a very elaborate task because several constraints must be satisfied. This paper presents a neural computing approach for automatically zoning constituencies. Our method was examined by using 25 Tokyo constituencies. Ideally, the weight of a single vote in a certain population to elect a representative should be equal to that of the other constituencies. Based on the established rule, the ratio of the lightest weight to the heaviest weight must be within two. Our result shows that our ratio is 1.28 while the current (official) ratio is 1.47.
開発途上国の疾病対策に貢献できる医学教育のあり方 (長崎大学熱帯医学研究所S)

多田功, 伊藤亮, 長谷川恭子, 門司和彦, 佐藤国雄, 溝田勉, 藤村建夫, 武藤佳恭

長崎大学熱帯医学研究所共同研究報告集 1996 1996年
ニューラルコンピューティングの遊び方 7 RNAの2次構造を予測する

小川智明, 武藤佳恭

BIT (Tokyo) 28(5) 1996年
企業戦略とCALS CALSを効率よく運用するための環境整備

武藤佳恭, 宮脇一茂, 増田俊司, 松浦有里

自動化技術 28(1) 1996年
ニューラルコンピューティングによる小選挙区区割り手法

斉藤孝之, 武藤佳恭

情報処理学会論文誌 37(4) 1996年
ニューラルコンピューティングの遊び方 4 エキサイティングなトーナメント表の作り方

岡宗一, 武藤佳恭

BIT (Tokyo) 28(2) 1996年
ニューラルコンピューティングの遊び方 8 最適配置問題

井庭崇, 岡宗一, 武藤佳恭

BIT (Tokyo) 28(6) 1996年
ニューラルコンピューティングの遊び方 6 クロスワードパズルに挑戦しよう

岡宗一, 武藤佳恭

BIT (Tokyo) 28(4) 1996年
ニューラル・ネットワーク手法によるモンゴルにおける空港配置問題

大島典子, 小川智明, 武藤佳恭

情報処理学会研究報告 96(105(AI-106)) 1996年
ニューラルコンピューティングの遊び方5 新しい学習モデルを用いて生体機能解明の謎に迫る

岡宗一, 小川智明, 武藤佳恭

BIT (Tokyo) 28(3) 1996年
ニューラルコンピューティングの遊び方 9 小選挙区区割り問題

斎藤孝之, 武藤佳恭

BIT (Tokyo) 28(7) 88-91 1996年
ネットワークコンピュータの衝撃

武藤佳恭

ビジネスコミュニケーション 33(9) 20-24 1996年
ニューラルコンピューティングの遊び方 12 自己組織化モデルによるMRI画像処理

岡宗一, 茶志川孝和, 武藤佳恭

BIT (Tokyo) 28(10) 79-82 1996年
OCNへの期待と要望

武藤佳恭

ビジネスコミュニケーション 33(10) 14-18 1996年
ニューラルコンピューティングの遊び方13 ニューラルネットワークで解く「一筆書き」

青葉雅人, 武藤佳恭

BIT (Tokyo) 28(11) 48-51 1996年
「人財」活用マルチメディア・データベースシステム

国友優子, 滝山真也, 花田光世, 武藤佳恭

情報処理学会研究報告 95(65(DBS-104)) 1995年
ニューラルコンピューティングの遊び方 1 Nクイーン問題から未解決幾何問題への旅

武藤佳恭

BIT (Tokyo) 27(10) 1995年
インターネットを利用した情報探索入門 (<小特集>「人工知能研究者のためのインターネット活用術」)

武藤佳恭, Yoshiyasu Takefuji, 慶應義塾大学環境情報学部:ケースウェスタンリザーブ大学電気工学応用物理学科, Faculty of Environmental Information Keio University:Dept of Electrical Engineering and Applied Physics Case Western Reserve University

人工知能学会誌 = Journal of Japanese Society for Artificial Intelligence 9(6) 799-803 1994年11月1日
インターネットの遊び方 3

武藤佳恭

BIT (Tokyo) 25(1) 1993年
インターネットの遊び方 (12)

武藤佳恭

BIT (Tokyo) 25(10) 1993年
インターネットの遊び方完

武藤佳恭

BIT (Tokyo) 25(12) 1993年
インターネットの遊び方(13)

武藤佳恭

BIT (Tokyo) 25(11) 1993年
インターネットの遊び方(5)

武藤佳恭

BIT (Tokyo) 25(3) 1993年
インターネットの遊び方 4

武藤佳恭

BIT (Tokyo) 25(2) 1993年
インターネットの遊び方 (6)

武藤佳恭

BIT (Tokyo) 25(4) 1993年
インターネットの遊び方(8)

武藤佳恭

BIT (Tokyo) 25(6) 1993年
インターネットの遊び方 (9)

武藤佳恭

BIT (Tokyo) 25(7) 1993年
インターネットの遊び方 10

武藤佳恭

BIT (Tokyo) 25(8) 1993年
インターネットの遊び方 7

武藤佳恭

BIT (Tokyo) 25(5) 1993年
インターネットの遊び方 11

武藤佳恭

BIT (Tokyo) 25(9) 1993年
ニューラルネットワークに基づく並列自動配線アルゴリズム

鈴来響太郎, 花田彰, 天野英晴, 武藤佳恭

情報処理学会研究報告 93(111(ARC-103 DA-69)) 9-16 1993年

現在までに提案されている並列自動配線アルゴリズムのほとんどは,従来からある迷路法,線分探索法を並列化したものである.このため,細粒度の並列化と高いプロセッサ利用率を同時に実現できず,並列計算機に実装した場合に高い台数効果を得ることが難しい.本研究では,この条件を満たせるようにニューラルネットワークに基づく並列自動配線アルゴリズムを提案し,シーケンシャルマシン上に実装してアルゴリズムの質の評価を行なう.また,並列計算機への実装の方法についても検討する.
SPECIAL ISSUE - ANALOG VLSI NEURAL NETWORKS

Y TAKEFUJI

ANALOG INTEGRATED CIRCUITS AND SIGNAL PROCESSING 2(4) 263-264 1992年11月
インターネットの遊び方(2)

武藤佳恭

BIT (Tokyo) 24(11) 1992年
インターネットの遊び方 1

武藤佳恭

BIT (Tokyo) 24(10) 1992年
特集ニューラルネットワークニューラルネットワークの組合せ最適化への応用

武藤佳恭

オペレーションズ・リサーチ 37(7) 1992年
ニューラルネットワークを用いた最適化アルゴリズムの並列計算機への実装と評価

大和純一, 鈴来響太郎, 天野英晴, 武藤佳恭

電子情報通信学会技術研究報告 92(171(CPSY92 1-8)) 1992年
確率的な連続値ニューロン・モデルである「ガウシアン・マシン」の提案

秋山泰, 武藤佳恭, 相磯秀夫

情報処理学会全国大会講演論文集 36th(1) 1988年
フォールト・トレラント・ゲートをVLSIに適用したときの信頼性改善および歩留り改善について

武藤佳恭, 足立佳彦, 相磯秀夫

情報処理学会論文誌 24(1) 80-88 1983年1月15日

計算機の高信頼化を計る一手法としてわれわれはフォールト・トレラント・ゲート(以下FTGとよぶ)を提案した.このFTGを用いて冗長大規模集積回路チップを構成すると従来のゲートで構成された同機能の非冗長大規模集積回路チップに比べ高信頼化が達成できるだけでなく歩留りの改善も期待できる.そこで本論文ではまず初めに新たに3状態FTGを提案し基本的な論理素子がすべてFTG化できることを示す.また NOR-FTGのトランジスタによる構成例を示し従来のゲートとの信頼性の比較および FTG方式とTMR方式との信頼性の比較を行っている.次にこれらの条件のもとで FTGで大規模集積回路を構築した際の歩留り向上について論ずる.従来のゲートおよび FTGのモデル歩留り劣化の主要因である欠陥のモデルを考え計算機シミュレーションを行うことにより歩留りの予測を行った.このシミュレーション結果を非冗長チップに関する歩留り予測式から求めた歩留りと比較することにより FTGで構成された冗長チップの歩留りが従来のものに比べ大幅に改善されることを確認した.
論理回路の自動設計に関する研究

宮崎淳, 辻野晃一郎, 武藤佳恭, 相磯秀夫

情報処理学会全国大会講演論文集 26th(2) 1983年
フォールト・トレラント・ゲートをVLSIに適用したときの信頼性改善および歩留り改善について

武藤佳恭, 足立佳彦, 相磯秀夫

情報処理学会論文誌 24(1) 1983年
多剰余系暗号器アルゴリズム

武藤佳恭, 黒川恭一, 辻野晃一郎, 相磯秀夫

情報処理学会全国大会講演論文集 25th(3) 1982年
TMS方式による不良メモリの救済方法

武藤佳恭, 足立佳彦, 相磯秀夫

電子通信学会論文誌 D 65(1) 1982年
フォールト・トレラント・ゲートで構成されたVLSIの歩留りに関する研究

足立佳彦, 武藤佳恭, 相磯秀夫

情報処理学会全国大会講演論文集 23rd 1981年
フォールト・トレラント・ゲートの提案

武藤佳恭, 池田政弘

情報処理学会全国大会講演論文集 21ST 1980年