An Improved Firefly Algorithm for Mining High Utility Itemsets
Received: 28 April 2025 | Revised: 14 June 2025 and 19 June 2025 | Accepted: 21 June 2025 | Online: 6 October 2025
Corresponding author: Keerthi Mohan
Abstract
High-Utility Itemset Mining (HUIM) is a pivotal subfield of data mining that focuses on identifying itemsets with high utility rather than merely frequent patterns. Traditional HUIM algorithms are hindered by the exponential complexity of their search space, rendering them inefficient for large-scale databases. Evolutionary Computation (EC)-based approaches have emerged as promising alternatives, offering approximate, yet effective solutions. However, the existing EC-based HUIM methods still encounter considerable computational overhead when extracting true High-Utility Itemsets (HUIs). To address this constraint, this paper proposes a Firefly Algorithm (FA) and a modified FA for HUIM, incorporating a novel flexible inertia weight predicated on logarithmic decrement. This innovation leads to faster convergence and increased mining efficiency. The experimental evaluations on four publicly available benchmark datasets demonstrate that the proposed approach significantly outperforms state-of-the-art algorithms, including HUIM-Genetic Algorithm (GA), Hybrid Ant Colony Optimization (ACO)/GA, and HUIM-FA in terms of utility discovery, execution time, and scalability. The results indicate that the method is highly scalable and maintains robust performance across increasing transaction volumes and item dimensions. It achieves superior runtime, memory efficiency, and a high number of HUIs while sustaining a strong convergence rate, thus establishing its practicality for real-world, large-scale HUIM applications.
Keywords:
high utility itemset mining, evolutionary computation, firefly algorithm, itemsetsDownloads
References
R. Agrawal and R. Srikant, "Fast algorithms for mining association rules in large databases," in VLDB ’94: Proceedings of the 20th International Conference on Very Large Data Bases, San Francisco, CA, United States, Sep. 1994, pp. 487–499.
Y. Liu, W. Liao, and A. Choudhary, "A Two-Phase Algorithm for Fast Discovery of High Utility Itemsets," in Lecture Notes in Computer Science, Berlin, Heidelberg, 2005, pp. 689–695.
V. S. Tseng, B.-E. Shie, C.-W. Wu, and P. S. Yu, "Efficient Algorithms for Mining High Utility Itemsets from Transactional Databases," IEEE Transactions on Knowledge and Data Engineering, vol. 25, no. 8, pp. 1772–1786, Aug. 2013.
S. Kannimuthu and K. Premalatha, "Discovery of High Utility Itemsets Using Genetic Algorithm with Ranked Mutation," Applied Artificial Intelligence, vol. 28, no. 4, pp. 337–359, Apr. 2014.
S. Zida, P. Fournier-Viger, J. C.-W. Lin, C.-W. Wu, and V. S. Tseng, "EFIM: A Highly Efficient Algorithm for High-Utility Itemset Mining," in Lecture Notes in Computer Science, Cham: Springer International Publishing, 2015, pp. 530–546.
A. Y. Peng, Y. S. Koh, and P. Riddle, "mHUIMiner: A Fast High Utility Itemset Mining Algorithm for Sparse Datasets," in Lecture Notes in Computer Science, Cham: Springer International Publishing, 2017, pp. 196–207.
J. C.-W. Lin, L. Yang, P. Fournier-Viger, T.-P. Hong, and M. Voznak, "A binary PSO approach to mine high-utility itemsets," Soft Computing, vol. 21, no. 17, pp. 5103–5121, Sep. 2017, https://doi.org/10.1007/s00500-016-2106-1.
W. Song and J. Li, "Discovering High Utility Itemsets Using Set-Based Particle Swarm Optimization," in Lecture Notes in Computer Science, Cham: Springer International Publishing, 2020, pp. 38–53.
J. M.-T. Wu, J. Zhan, and J. C.-W. Lin, "An ACO-based approach to mine high-utility itemsets," Knowledge-Based Systems, vol. 116, pp. 102–113, Jan. 2017.
K. Mohan and J. Anitha, "Mining High Utility Itemset with Hybrid Ant Colony Optimization Algorithm," International Journal of Advanced Computer Science and Applications, vol. 15, no. 12, 2024.
M. S. Nawaz, P. Fournier-Viger, U. Yun, Y. Wu, and W. Song, "Mining High Utility Itemsets with Hill Climbing and Simulated Annealing," ACM Transactions on Management Information Systems, vol. 13, no. 1, pp. 1–22, Mar. 2022.
M. Ledmi, A. Ledmi, M. E. H. Souidi, A. Hamdi-Cherif, T. M. Maarouk, and C. K.-M. Hamdi-Cherif, "High-utility itemsets mining integrating an improved crow search algorithm and particle search optimization," Soft Computing, vol. 28, no. 13–14, pp. 8471–8496, Jul. 2024.
S. R. Meruva and B. Venkateswarlu, "Dynamic Association Mining Techniques for the Faster Extraction of High Utility Itemsets from Incremental Databases," Engineering, Technology & Applied Science Research, vol. 15, no. 1, pp. 19396–19400, Feb. 2025.
Y. Li, Y. Zhao, Y. Shang, and J. Liu, "An improved firefly algorithm with dynamic self-adaptive adjustment," PLOS ONE, vol. 16, no. 10, Oct. 2021, Art. no. e0255951.
X.-S. Yang, Nature-inspired metaheuristic algorithms, 2nd ed. Frome: Luniver Press, 2010.
R. J. Bayardo, "Efficiently mining long patterns from databases," ACM SIGMOD Record, vol. 27, no. 2, pp. 85–93, Jun. 1998.
"Mushroom," UCI Machine Learning Repository, 1981. [Online]. Available: https://doi.org/10.24432/C5959T.
D. Chen. "Online Retail II," UCI Machine Learning Repository, 2012. [Online]. Available: https://doi.org/10.24432/C5CG6D.
M. Bain and A. Hoff. "Chess (King-Rook vs. King)," UCI Machine Learning Repository, 1994. [Online]. Available: https://doi.org/10.24432/C57W2S.
J. Tromp. "Connect-4," UCI Machine Learning Repository, 1995. [Online]. Available: https://doi.org/10.24432/C59P43.
P. Fournier-Viger et al., "The SPMF Open-Source Data Mining Library Version 2," in Lecture Notes in Computer Science, Cham: Springer International Publishing, 2016, pp. 36–40.
Downloads
How to Cite
License
Copyright (c) 2025 Keerthi Mohan, J. Anitha

This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors who publish with this journal agree to the following terms:
- Authors retain the copyright and grant the journal the right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) after its publication in ETASR with an acknowledgement of its initial publication in this journal.