×

You are using an outdated browser Internet Explorer. It does not support some functions of the site.

Recommend that you install one of the following browsers: Firefox, Opera or Chrome.

Contacts:

+7 961 270-60-01
ivdon3@bk.ru

A method for automated formation of a training data set for machine learning algorithms for classification of electronic documents

Abstract

A method for automated formation of a training data set for machine learning algorithms for classification of electronic documents

Korolev I.D., Akinfiev D.V.

Incoming article date: 24.08.2023

The article considers a method of automated formation of a training data set for machine learning algorithms for classification of electronic documents, which differs from the known ones by forming training data sets based on the synthesis of clustering and data augmentation methods based on calculating the distance between objects in multidimensional spaces.

Keywords: teaching with a teacher, clustering, pattern recognition, machine learning algorithm, electronic document, vectorization, formalized documents