Beyond XGBoost and SHAP: Unveiling true feature importance

Yoshiyasu Takefuji

Journal of Hazardous Materials 488 2025年5月5日査読有り

This paper outlines key machine learning principles, focusing on the use of XGBoost and SHAP values to assist researchers in avoiding analytical pitfalls. XGBoost builds models by incrementally adding decision trees, each addressing the errors of the previous one, which can result in inflated feature importance scores due to the method's emphasis on misclassified examples. While SHAP values provide a theoretically robust way to interpret predictions, their dependence on model structure and feature interactions can introduce biases. The lack of ground truth values complicates model evaluation, as biased feature importance can obscure real relationships with target variables. Ground truth values, representing the actual labels used in model training and validation, are crucial for improving predictive accuracy, serving as benchmarks for comparing model outcomes to true results. However, they do not ensure real associations between features and targets. Instead, they help gauge the model's effectiveness in achieving high accuracy. This paper underscores the necessity for researchers to recognize biases in feature importance and model evaluation, advocating for the use of rigorous statistical methods to enhance the reliability of analyses in machine learning research.
Challenges in feature importance interpretation: Analyzing LSTM-NN predictions in battery material flotation

Yoshiyasu Takefuji

Journal of Industrial Information Integration 45 2025年5月査読有り

Gomez-Flores et al. proposed a Long Short-Term Memory Neural Network (LSTM-NN) for predicting the flotation behavior of battery active materials using various physicochemical and hydrodynamic variables. While they achieved high prediction accuracy, validated through Mean Relative Error (MRE) and Mean Squared Error (MSE) metrics, concerns arise regarding the integrity of feature importance assessments derived from SAGE and SHAP methodologies. Specifically, the reliance on these model-specific techniques can introduce biases, obscuring the true relationships between features. Additionally, while Spearman's correlation elucidated significant relationships among variables, the absence of discussion on p-values left gaps in interpretation. This study emphasizes the need for cautious interpretation of feature importance metrics and the elimination of less significant variables, aiming to enhance model robustness and improve actionable insights in machine learning contexts.
Enhancing Feature Importance Analysis in Battery Research: A Statistical Methods Perspective on Machine Learning Limitations

Yoshiyasu Takefuji

Energy Storage Materials 2025年4月査読有り
Visualizing disparity trends on felony sentence-imposed months by gender and race with generative AI

Yoshiyasu Takefuji

Cities 159 2025年4月査読有り

This study analyzes trends in felony sentence disparity based on gender and race from 2010 to 2024. It utilizes a generative AI to create Python code for data visualization and employs three statistical methods (ANOVA, Chi-Square, Fisher's Exact) to assess p-values. The p-value signifies the probability of random chance causing the observed association. A significance level of 0.05 is used as a benchmark. The evidence-based analysis reveals a concerning trend: increasing disparities in sentences across genders and races. The findings highlight the need for further research and policy changes to address these disparities in the criminal justice system. The paper offers a novel visualization approach to depict these trends, aiding comprehension of the issue.
Assessing COVID‐19 Outcomes in Light of Cultural Norms and Policy Changes: A Time‐Series Analysis Tool Grounded in Safety‐Focused Policy Science

Yoshiyasu Takefuji

World Medical & Health Policy 2025年3月13日査読有り

Cultural norms and traditional behaviors have significantly influenced the outcomes of the COVID-19 pandemic. Asian countries initially outperformed their Western counterparts due to their cultural practices. However, policy shifts have led to a decline in these countries' performance. The objective of this study is to scrutinize the performance of different countries in managing the COVID-19 pandemic, with a focus on the influence of cultural norms and traditional behaviors, and to propose a tool that can inform and enhance current urban management policies. This study employs a time-series policy outcome analysis tool that operates on a single metric: the daily cumulative mortality of the population. By implementing a test-isolation strategy to manage quarantine periods, this tool aims to significantly influence the pandemic's outcome. The tool's efficacy is showcased through a case study involving four countries. New insights are validated and visualized via generated graphs, demonstrating the potential of this tool in the realm of tourism and urban management. This proposed tool holds promise for informing and enhancing current urban management policies, thereby mitigating unnecessary tourism-related fatalities in future pandemics. It underscores the importance of having the right information at the right time to make informed decisions in response to a pandemic.
Limitations of Sparse Partial Least Squares in Multiomics: A Critical Analysis of Linear Methods Applied to Nonlinear Biological Data

Yoshiyasu Takefuji

Journal of Hepatology 2025年3月査読有り
Reevaluating Feature Importance in Machine Learning: Concerns Regarding SHAP Interpretations in the Context of the EU Artificial Intelligence Act

Yoshiyasu Takefuji

Water Research 280 2025年3月査読有り

This paper critically examines the analysis conducted by Maußner et al. on AI analysis, particularly their interpretation of feature importances derived from various machine learning models using SHAP (SHapley Additive exPlanations). Although SHAP aids in interpretability, it is subject to model-specific biases that can misrepresent relationships between variables. The paper emphasizes the lack of ground truth values in feature importance assessments and calls for careful consideration of statistical methodologies, including robust nonparametric approaches. By advocating for the use of Spearman's correlation with p-values and Kendall's tau with p-values, this work aims to strengthen the integrity of findings in machine learning studies, ensuring that conclusions drawn are reliable and actionable.
Comparative Analysis of Invasive Pneumococcal Disease (IPD) Trends in the US

Yoshiyasu Takefuji

The Microbe 7 2025年3月査読有り

This study investigates the trends of Invasive Pneumococcal Disease (IPD) in the US from 1998 to 2021, segmented by age groups, using data from the CDC. The trends were visualized using generative AI, revealing a correlation between the IPD trends in the US and the COVID-19 pandemic. A significant reduction in the number of cases for serotype 14 was observed in the “< 2” age group, which decreased from 246 cases in 2000–58 cases in 2001. This drop could be a contributing factor to the overall decline in cases during this period. Similarly, in the “65 + ” age group, a significant reduction in the number of cases for serotype 3 was noticed, which fell from 197 cases in 2019–64 cases in 2021. These findings underscore the importance of continuous monitoring and intervention strategies in managing IPD.
Enhancing Feature Importance Analysis with Spearman's Correlation with p-values: Recommendations for Improving PHLF Prediction

Haoqian Pan, Yoshiyasu Takefuji

European Journal of Surgical Oncology 51(7) 2025年3月査読有り
Exploring the Impact of Myofascial Release on Mental Health and Sleep Quality

Yoshiyasu Takefuji

Sleep and Vigilance 2025年3月1日査読有り

This literature review examines the interplay between myofascial release, mental health, and sleep quality, utilizing peer-reviewed studies from the National Library of Medicine (2021–2024). Analyzing ten selected articles, findings indicate that myofascial release significantly enhances mental health by alleviating depression and anxiety associated with various physical conditions, such as premenstrual syndrome, chronic pain syndromes, postpartum dysfunctions, and fibromyalgia. These improvements in mental well-being are closely linked to enhanced sleep quality and overall quality of life. For instance, myofascial techniques combined with progressive relaxation improved sleep and reduced pain in premenstrual syndrome, while similar approaches in chronic low back pain and fibromyalgia patients led to better sleep and reduced psychological distress. The review underscores myofascial release as a promising complementary therapy for promoting mental health and sleep, though it calls for more rigorous randomized-controlled trials to further validate these effects and elucidate underlying mechanisms.
Characterizing Geographical Distribution of Tor Node by Local Density Comparison

Ruo Ando, Nguyen Minh Hieu, Pan Haoqian., Yi Liu, Yoshiyasu Takefuji

International Journal of Computer Science and Network Security 2025年2月28日査読有り
Addressing burnout among nephrologists: insights and solutions

Yoshiyasu Takefuji

International Urology and Nephrology 2025年2月27日査読有り
AI-Driven Analysis of Drug Marketing Efficiency: Unveiling FDA Approval to Market Release Dynamics

Yoshiyasu Takefuji

The AAPS Journal 27(2) 48 2025年2月20日査読有り

This paper explores a novel approach using generative AI to enhance drug marketing strategies in the US pharmaceutical sector. By leveraging an official dataset sourced from the US government, the AI generates Python code to analyze the time interval between FDA approval dates and market release dates. The analysis identifies 370 manufacturers who achieved "zero-day" marketing-referring to drugs marketed immediately upon FDA approval-and 174 manufacturers who marketed their products within less than seven days of approval. Notably, 947 drug products were found to have been marketed prior to FDA approval, raising significant regulatory and ethical concerns that necessitate further discussion. The findings indicate that 174 drug manufacturers have the potential to optimize their marketing strategies to achieve zero-day timelines, prompting an examination of the feasibility of such acceleration within the current regulatory framework and its implications for industry practices. Additionally, this paper discusses the broader impact of AI-driven strategies in the pharmaceutical sector, highlighting their potential to not only enhance marketing speed but also improve aspects such as compliance and decision-making efficiency. Furthermore, a tutorial on implementing generative AI is provided, detailing how it can be utilized to achieve marketing objectives through interactive conversations with the AI. This practical application demonstrates the technology's capabilities using real dataset analysis and reveals significant findings that could inform future strategies within the industry. The research objectives and their broader implications underscore the need for ongoing dialogue about the ethical and regulatory dimensions of AI in pharmaceutical marketing.
Enhancing Clustering Validity in MELD Subtype Analysis: A Call for Robust Statistical Methods

Yoshiyasu Takefuji

Journal of Hepatology 2025年2月査読有り
Critical evaluation of feature importance assessment in proteomic analysis using skin microdialysis

Yoshiyasu Takefuji

Journal of Allergy and Clinical Immunology 2025年2月査読有り
Beyond principal component analysis: Enhancing feature reduction in electronic noses through robust statistical methods

Yoshiyasu Takefuji

Trends in Food Science & Technology 157 2025年2月査読有り

Background: Zheng et al. (2025) provided a comprehensive review of advancements in electronic noses used for detecting alcoholic beverages. Their work highlights the critical role of Principal Component Analysis (PCA) in feature reduction, which enhances the accuracy of various analytical methods such as linear discriminant analysis (LDA), random forest (RF), convolutional neural networks (CNN), and back propagation neural networks (BPNN). While PCA is a widely used technique, its application in electronic nose technologies necessitates a closer examination of its limitations. Scope and approach: This paper critically evaluates the limitations of PCA when applied to nonlinear and nonparametric data, emphasizing the potential for distorted conclusions that can arise from its use. Through an extensive literature review, the paper discusses the implications of PCA within electronic nose applications. Key areas of focus include the importance of assessing data distribution, understanding statistical relationships, and validating significance using p-values. Additionally, the paper advocates for the adoption of nonparametric statistical methods, such as Spearman's correlation and Kendall's tau, to enhance the reliability of the analyses conducted. Key findings and conclusion: The review reveals that the linear assumptions underlying PCA may misrepresent variance in nonlinear datasets, leading to misleading projections that obscure structural information. PCA's focus on global patterns can also overlook significant local variations, potentially causing overlaps among distinct classes within high-dimensional data. These limitations necessitate caution when utilizing PCA in electronic nose technologies. Therefore, to ensure valid and reliable results in this rapidly advancing field, it is essential to adopt robust statistical methods and conduct thorough preliminary analyses that account for the specific characteristics of the data. Mitigating the risks of distorted conclusions will improve the accuracy and credibility of findings in this area of research.
Dental Implant Prevalence and Durability: A Concise Review of Factors Influencing Success and Failure

Yoshiyasu Takefuji

Biomaterials and Biosystems 17 2025年2月査読有り

This study aimed to evaluate the prevalence of dental implants and the factors influencing their survival rates, including systemic disorders, medication use, lifestyle habits, and implant design. A literature review revealed that implants with laser-microtextured grooves exhibited lower peri‑implantitis incidence and higher survival rates. Early failure often correlated with smoking, male gender, and younger age, while adjacent teeth faced an increased risk of loss. Personality traits were found to affect implant success in older patients, alongside concerns regarding the durability of titanium implants. The findings stress the necessity of comprehensive patient evaluations and enhanced diagnostic skills for improving dental implant outcomes.
Reevaluating feature importance in machine learning for food authentication: Addressing bias and enhancing methodological rigor

Yoshiyasu Takefuji

Trends in Food Science & Technology 157 2025年2月査読有り

Background: Bhat et al. (2025) highlight the significant role of artificial intelligence (AI) and machine learning (ML) in food authentication through advanced algorithms that analyze large datasets for patterns associated with food fraud. Objective: This paper aims to critically assess the approach of Bhat et al., with a specific focus on model-based feature importance and the biases related to traditional machine learning methods. Methods: The paper distinguishes between machine learning target predictions and feature importances, advocating for the rigorous application of robust statistical techniques, including Spearman's correlation and p-values, to accurately reveal genuine associations among variables. Results: The analysis emphasizes the necessity for researchers to comprehend the foundational principles of AI and ML to avoid misapplication of these technologies. Conclusion: The paper recommends integrating both nonparametric and nonlinear methods to effectively reduce bias and improve the reliability of feature importance assessments in food authentication.
Analyzing patterns of frequent mental distress in Alzheimer's patients: A generative AI approach

Yoshiyasu Takefuji

Journal of the National Medical Association 117(1) 20-24 2025年2月査読有り

This study tackles creating Python code for beginners with generative AI and analyzing trends in mental distress among Alzheimer's patients in the US (2015–2022 CDC data). It guides beginners through using AI to generate code for visualizing these trends by age and sex. The findings reveal females, particularly those 50–64 years old, experience the highest rates of mental distress. This emphasizes the importance of considering age and sex when developing care and interventions for mental distress in Alzheimer's patients.
Revealing bias in feature importance through PLS-DA: A critical examination of machine learning applications in chronic liver disease

Yoshiyasu Takefuji

Journal of Hepatology 2025年2月査読有り
Reevaluating feature selection in phase field models for battery performance: A call for robust statistical approaches

Yoshiyasu Takefuji

Energy Storage Materials 75 2025年2月査読有り

Han and Lin's phase field model for lithium-ion batteries utilizes LASSO regression to analyze battery performance during galvanostatic cycling, aiming to simplify the relationship between parameters and Coulombic efficiency. Despite demonstrating accuracy, this paper critiques the reliance on LASSO for feature selection, highlighting its potential inadequacy in capturing nonlinear interactions within chemical properties. Traditional performance metrics, such as MAE, RMSE, and R², provide limited insights regarding individual feature contributions and the nuanced relationships present in the data. This paper advocates for adopting nonparametric statistical methods, such as Spearman's correlation and Kendall's tau, which can better elucidate complex variable associations and validate feature importance. Incorporating these methods will enhance the robustness of findings, promoting a clearer understanding of battery performance dynamics.
Reevaluating feature importances in machine learning models for schizophrenia and bipolar disorder: The need for true associations

Yoshiyasu Takefuji

Brain, Behavior, and Immunity 124 123-124 2025年2月査読有り

Skorobogatov et al. developed supervised machine learning models to predict diagnoses and illness states in schizophrenia and bipolar disorder. However, their reliance on bootstrap forests and generalized regressions introduces significant biases in feature importance assessments. This paper highlights the critical distinction between feature importances generated by machine learning and actual associations, which are often model-specific and context-dependent. We underscore the limitations of biased feature importances and advocate for the use of robust statistical methods, such as Chi-squared tests and Spearman's correlation, to reveal true associations. Reassessing findings with these methods will enable more accurate interpretations and reinforce the importance of understanding the limitations inherent in machine learning methodologies.
Global prevalence and solutions for burnout among rheumatologists

Yoshiyasu Takefuji

Zeitschrift für Rheumatologie 84(1) 57-58 2025年1月14日査読有り

Burnout among rheumatologists is globally prevalent, driven by low personal accomplishment, younger age, dissatisfaction with the specialty, low income, long hours, emotional exhaustion, and depersonalization. Mitigation strategies include addressing modifiable risk factors, implementing organizational measures, investing in well-being, assessing individual grit, and managing workload with virtual care platforms.
Advanced Skin Cancer Classification Using Yet Another Hill Climbing

Yoshiyasu Takefuji

Advances in Data Science and Adaptive Analysis 2025年1月10日査読有り
Reevaluating Feature Selection in Machine Learning Models for Identifying Disease-Modifying Agents in Obstructive Sleep Apnea

Yoshiyasu Takefuji

Archivos de Bronconeumología 2025年1月査読有り
Drug-Induced Liver Injury and the Urgent Need for Improved Diagnostic Test

Yoshiyasu Takefuji

Medical Reports 2025年1月査読有り
Letter to Editor-Beetroot juice intake positively influenced gut microbiota and inflammation but failed to improve functional outcomes in adults with Long COVID

Yoshiyasu Takefuji

Clinical Nutrition 46 117-118 2025年1月査読有り
Enhancing Colorectal Cancer Subtyping: Addressing Limitations of L1-Penalized Estimation in Alternative Splicing Analysis

Yoshiyasu Takefuji

Gastroenterology 2025年1月査読有り
Critical evaluation of feature importance assessment in FFNN-based models for predicting Kamlet-Taft parameters

Yoshiyasu Takefuji

Green Chemical Engineering 2025年1月査読有り

Mohan et al. developed a feed-forward neural network (FFNN) model to predict Kamlet-Taft parameters using quantum chemically derived features, achieving notable predictive accuracy. However, this study raises concerns about conflating prediction accuracy with feature importance accuracy, as high R2 and low RMSE do not guarantee valid feature importance assessments. The reliance on SHAP (SHapley Additive exPlanations) for feature evaluation is problematic due to model-specific biases that could misrepresent true associations. A broader understanding of data distribution, statistical relationships, and significance testing through p-values is essential to rectify this. This paper advocates for employing robust statistical methods, like Spearman's correlation, to effectively assess genuine associations and mitigate biases in feature importance analysis.
High Prevalence of Burnout in Veterinary Medicine: Exploring Interventions to Improve Well-being

Yoshiyasu Takefuji

The Veterinary Journal 310 2025年1月査読有り

The veterinary profession faces a critical challenge: burnout. Long hours, emotional strain, financial pressures, and difficult client interactions contribute to stress and drive veterinary professionals from the field. This harms not only their well-being but also patient care and workplace morale. Research highlights the concerning mental state of veterinarians, with studies finding high rates of burnout, compassion fatigue, and burden transfer (stress from client challenges). Early-career and female veterinarians are most vulnerable. Several studies explore interventions to improve veterinary well-being. Educational programs targeting communication and acceptance of difficult clients show promise in reducing stress and burnout. Additionally, a web-based acceptance and commitment therapy (ACT) program demonstrates effectiveness in reducing occupational distress. It is crucial to address the veterinary burnout crisis. By creating supportive work environments, prioritizing mental health, and implementing interventions like ACT training, we can retain veterinary professionals and ensure a thriving profession that provides optimal care for animals and their caregivers.
Addressing bias in feature importances derived from XGBoost. Comment on Br J Anaesth 2024;133:351–9

Yoshiyasu Takefuji

British Journal of Anaesthesia 134(2) 613-615 2025年1月査読有り
Addressing Bias in Feature Importance: A Hybrid Approach for Risk Prediction in Prognostic Survival Models

Yoshiyasu Takefuji

JCO Precision Oncology 9 2025年1月査読有り
Letter to the Editor-“Reply-letter to the editor”

Yoshiyasu Takefuji

Clinical Nutrition 44 7-8 2025年1月査読有り
A retrospective cohort study on lung cancer screening methods in Japan and the US

WenKang Huang, Yoshiyasu Takefuji

Ethics, Medicine and Public Health 33 2025年査読有り責任著者
Model-specific feature importances: Distinguishing true associations from target-feature relationships

Yoshiyasu Takefuji

Journal of Affective Disorders 369 390-391 2025年1月査読有り筆頭著者
Nutraceuticals: a promising, yet unregulated Frontier in Healthcare

Yoshiyasu Takefuji

European Journal of Clinical Nutrition 2024年12月16日査読有り

Nutraceuticals, with their potential health benefits, are increasingly being used to manage a variety of health conditions. The global market for nutraceuticals, valued at USD 540 billion in 2022, is projected to reach USD 1025 billion by 2030. This paper delves into the beneficial impacts of emerging nutraceuticals on a spectrum of medical disorders, drawing from credible sources from the National Library of Medicine. We have scrutinized studies on the application of nutraceuticals in treating conditions like sleep disorders, migraines, oxidative stress, mental health issues, pain disorders, obesity, gastrointestinal disorders, and even COVID-19. Our analysis indicates that nutraceuticals hold promise for addressing various health issues. However, this paper also sheds light on the health risks associated with nutraceuticals. Despite their widespread use, the safety and efficacy of nutraceuticals are still uncertain due to the lack of stringent regulations, unlike pharmaceutical drugs. This raises concerns about potential health risks and misleading claims. Research indicates that some supplements can cause adverse effects and interact with medications. Therefore, to ensure safe usage, it is imperative to implement stricter regulations, enhance reporting systems, and boost consumer awareness.
Unraveling Misinterpretations in Pediatric COVID-19 Admission Trends: The Impact of CDC Reporting Changes

Yoshiyasu Takefuji

Current Research in Translational Medicine 73(1) 2024年12月査読有り
Obesity in the US: Exploring the paradox of increasing obesity rates alongside growing physical activity

Yoshiyasu Takefuji

Obesity Research & Clinical Practice 18(6) 465-468 2024年12月査読有り

The study analyzes a CDC dataset on US adult obesity and physical activity from 2011 to 2022. Despite rising obesity rates and insufficient fruit consumption, physical activity levels are increasing and overweight rates are slightly declining. The role of ultra-processed food intake, price sensitivity, early eating habits, and stress in obesity is highlighted. The findings suggest a complex obesity epidemic, indicating the need for multifaceted solutions such as regulating ultra-processed foods, improving access to healthy foods, and promoting healthy eating habits from childhood.
Reevaluating Statistical Methods in Metabolomic Studies: A Case for Spearman's Correlation

Yoshiyasu Takefuji

Molecular Plant 18(3) 383 2024年12月査読有り
Mitigating Feature Importance Bias in Regression Models for Clinical Decision-Making

Yoshiyasu Takefuji

American Journal of Obstetrics and Gynecology 2024年12月査読有り
Artificial intelligence abnormal driving behavior detection for mitigating traffic accidents

Yoshiyasu Takefuji, Michiyasu Tano, Masaya Shigehara, Shunya Sato

Computers & Industrial Engineering 198 2024年12月査読有り筆頭著者

Annually, a tragic toll of 1.3 million lives is lost on roads across the globe, with tens of millions more suffering injuries or disabilities. The necessity for precise detection of abnormal driving behavior is paramount in reducing traffic accidents. This paper aims to bridge the gap between normal and abnormal driving patterns, offering near-flawless detection capabilities. This paper presents a novel AI tachograph prototype, the first of its kind, that can classify driving behavior into normal and abnormal in real time with an impressive accuracy of 99.99 %. This high level of accuracy is achieved by using a bias-reduction method. The bias-reduction method focuses on minimizing biases in the dataset, such as surrounding situations, location, driver information, and car types. This approach significantly enhances the prediction accuracy of existing machine learning algorithms. The dataset used for this research is quite extensive, consisting of anomaly data collected from 10,181 commercial vehicles and 12,530 drivers in just 0.1 s. This rich dataset is crucial for building a reliable model. The effectiveness of the proposed method was validated using 10-fold cross validation on 480 k to 540 k instances with 36 determinants. The results clearly demonstrated that reducing bias leads to higher prediction accuracy. The paper also plans to compare the prediction accuracy of balanced and imbalanced datasets. The findings from this research have broader implications as the proposed method can be applied generally to machine learning to improve prediction accuracy.
Drug-induced inflammation: A review of literature

Yoshiyasu Takefuji

Chemico-Biological Interactions 404 2024年12月査読有り筆頭著者

This review examines how various medications can trigger inflammation throughout the body. It explores causes, ranging from common pain relievers like NSAIDs to chemotherapy drugs. The review also highlights potential treatments, including established medications and promising new therapies. Physicians and patients can work together to reduce this risk by understanding these causes and implementing preventive measures, such as monitoring for side effects and using alternative medications when possible. Drug-induced inflammation can be categorized into four types based on the immune response involved. Symptoms vary by type and affected organ. Common symptoms include fever, malaise, joint pain, rash, and swelling. Diagnosis involves blood tests, imaging, and biopsies. Treatment primarily involves discontinuing the suspected drug and providing supportive care. The development of new drugs and therapies has made diagnosis challenging. However, recent advances in biomarkers and genetic risk assessment techniques are improving diagnosis and risk assessment of drug-induced liver injury. Preventive measures for drug-induced inflammation include monitoring for side effects, using alternative medications, developing new drug delivery methods, exploring new anti-inflammatory drugs, being aware of rare side effects, and understanding the underlying mechanisms.
Black box analysis with linear regression on global warming

Yoshiyasu Takefuji

Hygiene and Environmental Health Advances 12 2024年12月査読有り筆頭著者

This paper demonstrates that the conclusions drawn from datasets on global temperature anomaly and atmospheric CO2 from NOAA can vary depending on the range of investigated periods. By examining the data from both macroscopic and microscopic perspectives, the study reveals that different levels of analysis can produce different outcomes from the same datasets based on statistics.
Enhancing ESG Practices in Lithium Battery Recycling: A Review of Current Policies and Proposed Solutions

Yoshiyasu Takefuji

Materials Circular Economy 2024年12月査読有り筆頭著者
Autonomous AI shaft excavator: a case study on AI fairness for sustainability and green technology

Yoshiyasu Takefuji

Construction Robotics 2024年12月査読有り筆頭著者
PyPI: An internet-enabled learning tool to boost learner motivation for crop protection

Yoshiyasu Takefuji

Journal of Natural Pesticide Research 10 2024年12月査読有り筆頭著者

Instructors are always interested in methods to activate learner incentives and motivation to increase learning effectiveness. This paper introduces the Python Package Index (PyPI) as a powerful tool to maximize learner incentives on software and presents an example of its application in agriculture and science. The more useful the PyPI application is, the more it will be downloaded worldwide, providing an external review for the learner, and strengthening their incentive. However, many existing tutorials on PyPI, including the official site, are not updated on the twine library for uploading files to the PyPI site. This paper presents an updated tutorial on using PyPI for counting disaggregated objects such as bugs and pests, and for software reproducibility validation via Code Ocean. Additionally, generative AI is introduced as an indispensable assistant for tasks such as understanding technical terms and providing solutions for encountered problems.
Unveiling livestock trade trends: A beginner's guide to generative AI-powered visualization

Yoshiyasu Takefuji

Research in Veterinary Science 180 2024年11月査読有り

This tutorial, rooted in the context of livestock research, is designed to assist novice or non-programmers in visualizing trends in livestock exports between the US and Japan using Python and generative AI systems such as Microsoft's Copilot and Google's Gemini. The analysis of these trends plays a pivotal role in optimizing livestock production. The tutorial offers a thorough guide on preparing data using reliable federal datasets, generating Python code, and tackling potential issues such as overlapping data points. It effectively simplifies complex tasks into manageable steps and includes Python code in the appendices for easy reference. By enabling researchers to extract insights and make predictions from livestock data, this tutorial addresses a significant void in the existing literature. This innovative approach has the potential to transform the way researchers engage with and interpret livestock data, thereby making a substantial contribution to the field.
Addressing Feature Importance Biases in Machine Learning Models for Early Diagnosis of Type 1 Gaucher Disease

Yoshiyasu Takefuji

Journal of Clinical Epidemiology 178 2024年11月査読有り
Reassessing Feature Importance Biases in Machine Learning Models for Infection Analysis

Yoshiyasu Takefuji

Journal of Infection 89(6) 2024年11月査読有り筆頭著者
Unveiling feature importance biases in linear regression: Implications for protein-centric cardiovascular research

Yoshiyasu Takefuji

Atherosclerosis 401 2024年11月査読有り筆頭著者

MISC

199

Reevaluating feature importance in machine learning models for CO2 photoreduction: A statistical perspective

Yoshiyasu Takefuji

Applied Catalysis B: Environmental 368 2025年7月5日

Chen et al. have advanced the theoretical design of dual-site metallo-covalent organic frameworks for enhancing CO2 photoreduction into C2H4 using various machine learning algorithms. While they demonstrated high predictive accuracy using a stacking approach with seven selected algorithms, this study emphasizes the potential biases in feature importance derived from these models. It argues for the necessity of computing unbiased feature importances and highlights the complications posed by different methodologies across models. Further, it recommends robust statistical techniques, such as Spearman's correlation and Kendall's tau, to improve interpretability and validity. Addressing collinearity through Variance Inflation Factor (VIF) analysis is also crucial. These steps aim to deepen understanding and optimize machine learning applications for carbon capture and utilization.
Enhancing heart disease feature analysis with spearman's correlation with p-values

Haoqian Pan, Yoshiyasu Takefuji

International Journal of Cardiology 430 2025年7月1日
Reevaluating feature importance in gas–solid interaction predictions: A call for robust statistical methods

Yoshiyasu Takefuji

Coordination Chemistry Reviews 534 2025年7月1日

Liu et al. conducted an insightful investigation into feature importance analysis for predicting CH4 adsorption isotherms in metal–organic frameworks (MOFs), revealing key geometric features that influence model predictions. While their use of advanced machine learning techniques, including neural networks and extra tree regression (ETR), achieved notable accuracy, concerns arise regarding the model-specific biases in feature importance metrics. This paper critically evaluates these metrics, highlighting the risks of misinterpretation due to the lack of ground truth validation. We advocate for the adoption of bias-free statistical methods, such as Spearman's rank correlation and Kendall's tau, which offer a more reliable framework for assessing feature importance. Implementing these approaches could enhance the understanding of gas–solid interactions and improve the reliability of machine learning applications in this domain.
Enhancing machine learning in gas–solid interaction analysis: Addressing feature selection and dimensionality challenges

Yoshiyasu Takefuji

Coordination Chemistry Reviews 534 2025年7月1日

This paper addresses the critical importance of accurate analysis in research, emphasizing the necessity of error-free and unbiased calculations. While ground truth values are pivotal for validating accuracy, their absence poses challenges in feature importance, feature selection, and clustering methods commonly used in machine learning. Liu et al. have introduced innovative models targeting gas-solid interactions, but their reliance on model-specific methodologies raises concerns about potential biases and erroneous conclusions. This study advocates for robust statistical validation techniques, including the application of Variance Inflation Factor (VIF), Spearman's correlation, and Kendall's tau, to enhance the reliability of feature selection and ensure more accurate insights. By emphasizing a rigorous approach to statistical significance, this paper aims to improve the interpretability and effectiveness of machine learning applications in this specialized field.
An urgent call for robust statistical methods in reliable feature importance analysis across machine learning

Yoshiyasu Takefuji

Journal of Catalysis 446 2025年6月

Accurate analytical outcomes in machine learning are contingent on error-free calculations and a solid understanding of foundational principles. A notable challenge arises from the lack of ground truth values for validation, complicating the assessment of feature importance, especially when employing linear models with parametric assumptions. This paper critiques the use of Pearson correlation and feature importances derived from Gradient Boosting Regressor (GBR), emphasizing their limitations in analyzing nonlinear and nonparametric data. We propose robust statistical methods, such as Spearman's correlation and Kendall's tau, as alternatives for capturing complex relationships while providing essential directional information. Additionally, attention to Variance Inflation Factor (VIF) is crucial for mitigating feature inflation. By addressing these concerns, researchers can achieve more reliable analyses and deeper insight into variable relationships.