Method of text classification without the use of training
https://doi.org/10.17586/0021-3454-2026-69-1-90-94
Abstract
A new approach to text classification is proposed that does not employ machine learning methods or require a training set. The method is based on the Damerau-Levenshtein distance, which is the minimum number of editing operations required to transform one string into another and takes into account the semantic similarity of words, weighting of editing operations, and the order of importance of words. The main metrics for assessing the quality of a text classifier and the results of testing the proposed method against these metrics are presented.
About the Authors
T. M. TatarnikovaRussian Federation
Tatyana M. Tatarnikova — Dr. Sci., Professor; Institute of Information Technologies and Programming; Director of the Institute
St. Petersburg
D. R. Milyaev
Russian Federation
Dmitry R. Milyaev — Post-Graduate Student; Department of Information Systems
St. Petersburg
References
1. Dudikhin V.V., Kondrashov P.E. E-Journal Public Administration, 2024, no. 105, pp. 169–179, DOI: 10.55959/MSU2070-1381-105-2024-169-179. (in Russ.)
2. Houlsby N., Giurgiu A., Jastrzebski S., Morrone B. et al. Proc. 36th Int. Conf. on Machine Learning, 2019, vol. 97, pp. 2790–2799.
3. Kuznetsov A.V. New Information Technologies in Education and Science, 2022, no. 5, pp. 53–57, DOI: 10.17853/2587-6910-2022-05-53-57. (in Russ.)
4. Sovetov B.Ya., Tatarnikova T.M., Yashin A.I. Proceedings of Saint Petersburg Electrotechnical University, 2019, no. 4, pp. 26–32. (in Russ.)
5. Batura T.V. Software & Systems, 2017, no. 1(30), pp. 85–99, DOI: 10.15827/0236-235X.030.1.085-099. (in Russ.)
6. Belov S., Zrelova D., Zrelov P., Korenkov V. System Analysis in Science and Education, 2020, no. 3, pp. 8–22, URL: http://sanse.ru/download/401. (in Russ.)
7. Tatarnikova T.M., Mokretsov N.S. Software & Systems, 2025, no. 2, pp. 361–365, DOI: 10.15827/0236-235X.150.361 365. (in Russ.)
8. Maksyutin P.A., Shuljenko S.N. Ingineering Journal of Don, 2022, no. 12, URL: ivdon.ru/ru/magazine/archive/n12y2022/8043. (in Russ.)
9. Khurana A., Subramonyam H., Chilana P.K. Proc. of the 29th Intern. Conf. on Intelligent User Interfaces, 2024, рр. 288–303, https://doi.org/10.1145/3640543.3645200.
10. Tarasov D.V., Romanov N.A. University Proceedings. Volga Region. Technical Sciences, 2017, no. 1(41), pp. 56–72, DOI: 10.21685/2072-3059–2017-1-5.
11. Lane H., Hapke H., Howard C. Natural Language Processing in Action, Manning Publications Co., 2019, 544 p.
Review
For citations:
Tatarnikova T.M., Milyaev D.R. Method of text classification without the use of training. Journal of Instrument Engineering. 2026;69(1):90-94. (In Russ.) https://doi.org/10.17586/0021-3454-2026-69-1-90-94
JATS XML














