Preview

Journal of Instrument Engineering

Advanced search
Open Access Open Access  Restricted Access Subscription Access

Method of text classification without the use of training

https://doi.org/10.17586/0021-3454-2026-69-1-90-94

Abstract

A new approach to text classification is proposed that does not employ machine learning methods or require a training set. The method is based on the Damerau-Levenshtein distance, which is the minimum number of editing operations required to transform one string into another and takes into account the semantic similarity of words, weighting of editing operations, and the order of importance of words. The main metrics for assessing the quality of a text classifier and the results of testing the proposed method against these metrics are presented.

About the Authors

T. M. Tatarnikova
St. Peterdburg State University of Aerospace Instrumentation
Russian Federation

Tatyana M. Tatarnikova — Dr. Sci., Professor; Institute of Information Technologies and Programming; Director of the Institute

St. Petersburg



D. R. Milyaev
St. Petersburg Electrotechnical University „LETI“
Russian Federation

Dmitry R. Milyaev — Post-Graduate Student; Department of Information Systems

St. Petersburg



References

1. Dudikhin V.V., Kondrashov P.E. E-Journal Public Administration, 2024, no. 105, pp. 169–179, DOI: 10.55959/MSU2070-1381-105-2024-169-179. (in Russ.)

2. Houlsby N., Giurgiu A., Jastrzebski S., Morrone B. et al. Proc. 36th Int. Conf. on Machine Learning, 2019, vol. 97, pp. 2790–2799.

3. Kuznetsov A.V. New Information Technologies in Education and Science, 2022, no. 5, pp. 53–57, DOI: 10.17853/2587-6910-2022-05-53-57. (in Russ.)

4. Sovetov B.Ya., Tatarnikova T.M., Yashin A.I. Proceedings of Saint Petersburg Electrotechnical University, 2019, no. 4, pp. 26–32. (in Russ.)

5. Batura T.V. Software & Systems, 2017, no. 1(30), pp. 85–99, DOI: 10.15827/0236-235X.030.1.085-099. (in Russ.)

6. Belov S., Zrelova D., Zrelov P., Korenkov V. System Analysis in Science and Education, 2020, no. 3, pp. 8–22, URL: http://sanse.ru/download/401. (in Russ.)

7. Tatarnikova T.M., Mokretsov N.S. Software & Systems, 2025, no. 2, pp. 361–365, DOI: 10.15827/0236-235X.150.361 365. (in Russ.)

8. Maksyutin P.A., Shuljenko S.N. Ingineering Journal of Don, 2022, no. 12, URL: ivdon.ru/ru/magazine/archive/n12y2022/8043. (in Russ.)

9. Khurana A., Subramonyam H., Chilana P.K. Proc. of the 29th Intern. Conf. on Intelligent User Interfaces, 2024, рр. 288–303, https://doi.org/10.1145/3640543.3645200.

10. Tarasov D.V., Romanov N.A. University Proceedings. Volga Region. Technical Sciences, 2017, no. 1(41), pp. 56–72, DOI: 10.21685/2072-3059–2017-1-5.

11. Lane H., Hapke H., Howard C. Natural Language Processing in Action, Manning Publications Co., 2019, 544 p.


Review

For citations:


Tatarnikova T.M., Milyaev D.R. Method of text classification without the use of training. Journal of Instrument Engineering. 2026;69(1):90-94. (In Russ.) https://doi.org/10.17586/0021-3454-2026-69-1-90-94

Views: 173

JATS XML

ISSN 0021-3454 (Print)
ISSN 2500-0381 (Online)