International Journal Of Eurasia Social Sciences

DOĞAL DİLLERDEKİ İŞARETLEME SÜRECİNE BİLGİ KURAMI AÇISINDAN BAKIŞ (TÜRKÇE ÖRNEĞİNDE)

AN INSIGHT INTO THE CODING PROCESS OF MORPHEMES IN THE NATURAL LANGUAGES IN TERMS OF THE INFORMATION THEORY (IN THE EXAMPLE OF TURKISH)

DOĞAL DİLLERDEKİ İŞARETLEME SÜRECİNE BİLGİ KURAMI AÇISINDAN BAKIŞ (TÜRKÇE ÖRNEĞİNDE)
Author	: Çağlayan YILMAZ -
Type	:
Printing Year	:
Number	:
Page	:
DOI Number:	:
Cite :	Çağlayan YILMAZ -, (). DOĞAL DİLLERDEKİ İŞARETLEME SÜRECİNE BİLGİ KURAMI AÇISINDAN BAKIŞ (TÜRKÇE ÖRNEĞİNDE). International Journal Of Eurasia Social Sciences, , p. . Doi: .

Bu eserin tam metni dosya olarak bulunmamaktadır

Summary
Bu çalışma, anlam birimlerinin işaretlenme sürecinde geçirdiği dönüşüm üzerine odaklanmıştır. Bu dönüşüm kullanım kaynaklıdır. Boğumlanma özelliklerine göre eklemlenebilir parçalarına ayrılarak işaretlenen anlam birimleri, iletilerdeki kullanım sıklığına bağlı olarak gelişimini sürdürür. Çalışmada, Türkçedeki anlam birimlerinin bilgi değerlerine göre işaretlenmesi durumunda kullanılacak ölçüm için bir yöntem önerilmiştir. Bu yöntem, C. E. Shannon tarafından geliştirilen ve “anlam”ı ölçülebilir bir kavram olarak tanımlayan bilgi kuramı temellidir. Bu yöntem, iletilerdeki sembollerin bilgi değerini, onların iletilerdeki kullanım sıklığıyla ilişkilendirir. Bu sebeple, öncelikle bu ilişki üzerinde durulmuş, basit birkaç örnekle iletilerdeki sembollerin kullanım sıklıklarıyla onların işaret sayıları arasındaki ilişki ortaya konmuştur. Bu ilişki, verilen birkaç örnekle Türkçe sözcükler üzerinden somutlaştırılmıştır. Kullanım sıklıkları üzerinden birkaç Türkçe sözcüğün bilgi değeri ölçülmüş ve bu değerler üzerinden her bir anlam biriminin işaretlenmesi için gereken işaret sayısı belirlenmiştir. Ancak bunun için bir derlemin oluşturulması ve bu derlemdeki her bir metnin anlam birimlerine ayrıştırılması ve her bir anlam biriminin derlemdeki kullanım sıklığının belirlenmesi gerekmiştir. Bu aşamada, yine bu çalışmanın sahibi tarafından daha önce yapılmış bir çalışma için hazırlanmış olan, özellikle son 10 yılda yayımlanmış 100 adet metin parçasından oluşan derlemdeki metinlerin çözümlenmesiyle oluşturulmuş veri kullanılmıştır. Ayrıca yine söz konusu derlem kullanılarak, kullanım sıklıklarından hareketle, Türkçe anlam birimlerinin ortalama bilgi değeri (entropisi) hesaplanmış ve bu değer üzerinden, alfabedeki harf sayısıyla sözcükleri işaretlemek için gerekli olan işaret sayısı arasındaki ilişki (daha açık bir ifadeyle, bir ters orantı) ortaya konmuştur. Son olarak çalışmanın derleminden kaynaklı yöntem tercihi, tartışmaya sunulmuştur. Olması gereken ölçme yöntemi ile mümkün olan ölçme yöntemi arasındaki tercih, bilgi kuramının kendisinin sorgulanmasını gerektirecek boyuttadır. Alfabe ve yazı arasındaki ilişkiyi bilgi kuramıyla ilişkilendirerek ortaya koyan bu çalışma, sonuçlarından çok yöntemi bakımından tartışılmalıdır. Anlam birimlerinin ses değerine göre değil, bilgi değerine göre işaretlenmesi oldukça verimli kuramsal çalışmaların önünü açacaktır.

Keywords
bilgi kuramı, bilgi değeri, entropi, işaret, işaretleme, yazı

Abstract
This study focuses on the transformation that morphemes undergo in the coding process. This transformation is related to the use. The morphemes which are coded by being separated into the parts which can be articulated according to the characteristics of the articulation. In the study, a method, which will be used in the cases that the Turkish morphemes are codedaccording to the information of value of Turkish morphemes. This method is based on the information theory which is developed by C.E. Shannon and which describes “meaning” as a measurable concept. This method relates the information value of symbols in the messages to their frequency of use in the messages. Therefore, it was focused on this relation first and the relation between the relation between the frequency of use and their code numbers by giving several basic examples. This relation is objectified through Turkish words with several examples. The information of value of the several Turkish words is measured depending on their frequency of use and the code number whichis necessary for the coding of each morpheme, depending on these values. However, it is necessary that a corpus should be formed and the separation of each text into morphemes and that the frequency of use of each morpheme should be determined of each morphemes within the corpus. At that stage, the data formed by the analysis of the text pieces and composed of 100 text pieces published specifically in the last decade and which have been prepared by the author of this study for a previous study. In addition, the average enthropy of the Turkish morphemes has been calculated by considering their frequency of use by the mentioned corpus and their frequency of use and, based on this value, the relation between the number of words by the letter number in the alphabet and the code number necessary to code the words (in clearer terms, inverse proportion) are presented. In the end, the methodpreference based on the corpus of the study is presented for discussion. The preference between the dd method in the necessary measurement and the possible measurement method is at a dimension which will require the questioning of the information theory itself.This study,which relates the relation the alphabet and writing, should be discussed in terms of its method rather than its results. The coding of morphemesnot according to sound value of morphemes but according to their information value will lead way to efficient theoretical studies.

Keywords
informational theory, informational value, entropy, code, coding, writing