Sobiad Atıf Dizini

İndirme 3

Makale Detay

Benzer Makaleler

Dergi Bilgisi

Eseri Dinleyin

Alıntı Yap

Bu Sayfayı Yazdırın

Paylaş

Development of the quantitative method for automated text content authorship attribution based on the statistical analysis of N-grams distribution

2019

Dergi:

Eastern-European Journal of Enterprise Technologies

Yazar:

DOI:

10.15587/1729-4061.2019.186834

Özet:

The peculiarities of the application of linguo-statistics technologies for the identification of the style of the author of text content of scientific and technical profile are considered. Quantitative linguistic analysis of a text uses the benefits of content monitoring based on the NLP methods to identify and analyze the set of stop words, keywords, set phrases and to study N-gram. The latter are used in the linguometry methods to determine in per cent if the given text belongs to a particular author. The quantitative method for automatic text content authorship attribution was developed based on statistical analysis of the 3-gram distribution. The approach to the implementation of identification of the author of the text in the Ukrainian language of the scientific and technical profile was proposed. Experimental results of the proposed method to determine the belonging of the analyzed text to a specific author in the presence of the reference text were obtained. Application of the linguo-statistical analysis of the 3-grams to a set of articles will make it possible to form a subset of publications that are similar in linguistic descriptions. Imposing additional conditions in the form of statistical and quantitative analyses (a set of keywords, set expressions, stylometric, linguometric analyses, etc.) on a subset will allow a significant reduction of this subset by specifying the list of the most likely author. For qualitative and effective content analysis when determining the degree of authorship of a particular author, we propose to analyze the reference text and the one under consideration at several stages: linguometric analysis of the coefficients of the diversity of the author's speech, stylometric analysis, analysis of set expressions, linguo-statistical analysis of 3-grams. For automated text processing, not only the frequency of occurrence of a certain category, but also its existence in the studied text in general are important. Quantitative computation makes it possible to draw objective conclusions about the orientation of materials by the number of using the units of analysis in the studied texts. Qualitative analysis does the same, but as a result of the study of whether (and in what context) there is a certain important original category in general Author Biographies Vasyl Lytvyn, Lviv Polytechnic National University S. Bandery str., 12, Lvіv, Ukraine, 79013 Doctor of Technical Sciences, Professor Department of Information Systems and Networks

Anahtar Kelimeler:

Atıf Yapanlar

Bilgi: Bu yayına herhangi bir atıf yapılmamıştır.

Benzer Makaleler

1. Analysis of statistical methods for stable combinations determination of keywords identification

2018

Eastern-European Journal of Enterprise Technologies

2. Analysis of the developed quantitative method for automatic attribution of scientific and technical text content written in Ukrainian

2018

Eastern-European Journal of Enterprise Technologies

3. Development of the linguometric method for automatic identification of the author of text content based on statistical analysis of language diversity coefficients

2018

Eastern-European Journal of Enterprise Technologies

4. Method of determining keywords for English texts based on DKPro Core

2015

Technology Audit and Production Reserves

5. Development of a method for determining the keywords in the slavic language texts based on the technology of web mining

2017

Eastern-European Journal of Enterprise Technologies

6. Development of the method for filtering verbal noise while search keywords for the English text

2018

Technology Audit and Production Reserves

Eastern-European Journal of Enterprise Technologies

Alan : Fen Bilimleri ve Matematik

Dergi Türü : Uluslararası

Metrikler

Makale : 4.764

Atıf : 4.485

2023 Impact/Etki : 0.294

Detaylı İncele

Eastern-European Journal of Enterprise Technologies

Özet
Eseri Dinleyin

Yazar : --

Dergi :

Sayı

Yıl

Tür

Atıf Sayısı

PDF Görüntüle

Benzer Makaleler
Bu Yayına Atıf Yapanlar

Benzer Makaleler	Yazar	#

Makale	Yazar	#

Kullanım Kılavuzu

Menü

Mendeley

Endnote

Development of the quantitative method for automated text content authorship attribution based on the statistical analysis of N-grams distribution

2019

Dergi:

Eastern-European Journal of Enterprise Technologies

Yazar:

DOI:

10.15587/1729-4061.2019.186834

Özet:

Anahtar Kelimeler:

Atıf Yapanlar

Bilgi: Bu yayına herhangi bir atıf yapılmamıştır.

Benzer Makaleler

Eastern-European Journal of Enterprise Technologies

Metrikler