증권 기사의 자질 기반 감성 분석을 위한 주석 데이터 구축 연구 :

윤정우

추천

검색

자료유형: 학위논문

저자정보: 윤정우 (한국외국어대학교, 한국외국어대학교 대학원)

지도교수: 남지순

발행연도: 2023

저작권: 한국외국어대학교 논문은 저작권에 의해 보호받습니다.

이용수2

이 논문의 연구 히스토리 (2)

2023

증권 기사의 자질 기반 감성 분석을 위한 주석 데이터 구축 연구

윤정우 2023.01 학위논문

2021

자질기반 감성분석 모델 학습을 위한 주석데이터셋 FeSAD 구축 방법론

윤정우 , 남지순 언어연구 2021.11 학술저널

이 논문의 후속연구가 궁금하신가요?
연관 학술논문 또는 학술발표를 통해 보다 발전된 연구결과를 확인하실 수 있습니다.
이 논문의 연구 히스토리 확인하기

초록· 키워드

오류제보하기

This study proposes an annotated dataset for Feature-based Sentiment Analysis (FbSA) in the financial domain. The dataset is constructed based on a two-step annotation process, which consists of the SSP (Semi-automated Symbolic Propagation) methodology and supplementary annotation by a human annotator.
Financial domain research primarily relies on numerical data, but text data also provides crucial information that is difficult to quantify. Corporate articles offer various incidents about companies, and FbSA aims to categorize events related to companies and extract both positive and negative news from corporate articles.
Unlike sentiment analysis at the sentence level, FbSA requires advanced language knowledge to understand the semantic information for each element in a sentence. It also involves extracting relational information between vocabularies. Moreover, typical sentiment expressions are not commonly used in financial news data. Instead, various combinations of numerical nouns, predicates indicating ups and downs, and event nouns appear in different aspects, and the polarity also changes depending on the combination of these elements. Therefore, building high-quality annotated data for machine learning is a laborious task.
To achieve effective semantic information annotation, linguistic resources linked to the DECO Korean dictionary and LGG (Local Grammar Graph) were utilized. These linguistic resources consist of categorized patterns for entities, features, and sentiment expressions. They can be applied to the raw text to annotate single-word and multi-word expressions, as well as adjacent feature-sentiment expression pairs.
The linguistic resources constructed in this study reflect the aforementioned characteristics. The resources were augmented through a bootstrap method, which involved a cycle of semi-automatic annotation on the data and the addition of new expressions acquired during the annotator''s annotation process to the resources. It was confirmed that the final linguistic resources showed an f1-score of 0.90, demonstrating their contribution to increasing the efficiency of the actual annotation process.
The annotated data for the financial domain FbSA proposed in this study can be effectively extended to reflect changes in the stock market and the latest company and product information by utilizing structured linguistic resources. Furthermore, it is expected that the annotated data will be used for sophisticated stock price prediction in conjunction with numerical information.

제1장 서론 1
1.1. 연구의 목적과 필요성 1
1.2. 논문의 구성 7
제2장 감성 분석의 이론적 배경과 국내외 선행 연구 8
2.1. 감성 분석의 이론적 배경 8
2.1.1. 감성 표현의 양상 8
2.1.2. 자질 기반 감성 분석과 자질의 양상 10
2.2. 국내외 선행 연구 13
2.2.1. 감성 사전 및 감성 어휘 기반 연구 13
2.2.2. 규칙 기반 연구 15
2.2.3. 기계학습 기반 연구 16
제3장 연구 데이터와 연구 방법론 19
3.1. 연구 데이터 수집 19
3.2. 연구 데이터 분석 22
3.2.1. 개체명 23
3.2.2. 서술성 명사 24
3.2.3. 삽입 표현 26
3.2.4. 문맥 의존 감성 표현 28
3.2.5. 사건 유형 29
3.3. 연구 방법론 33
제4장 언어 자원 구축 35
4.1. SSP 방법론 35
4.2. DECO-LGG 언어 자원의 구조 38
4.3. 자질 기반 감성 분석을 위한 분류 체계 44
4.3.1. 개체명 분류 44
4.3.2. 자질 분류 46
4.3.3. 감성 극성 분류 50
4.4. 사전 자원 53
4.4.1. 개체명 53
4.4.2. 자질명 55
4.4.3. 감성 표현 58
4.5. 그래프 자원 63
4.5.1. 개체명 및 자질명 63
4.5.2. 감성 표현 65
4.6. 언어 자원의 증강과 작업자의 주석 보강 프로세스 72
제5장 자질 기반 감성 분석 3원소의 페어링과 최종 데이터 구성 74
5.1. 자질 기반 감성 분석 3원소의 페어링 74
5.2. 최종 주석 데이터 구성 79
5.2.1. 전체 데이터 통계 79
5.2.2. 자질별 통계 80
제6장 언어 자원의 성능 평가 82
제7장 결론 88
참고문헌 91
ABSTRACT 97

최근 본 자료

전체보기

구분	그룹	데이터 항목
AI 학습용 데이터	원문	원문 PDF 파일
AI 학습용 데이터	원문 + 메타 (기본/상세)	원문 PDF 파일 및 서지정보 CSV
대량 구매용 데이터	B2B 구독 방식	특정 자료 한정으로 원문 접근 권한 부여
대량 구매용 데이터	URL 전달 방식	바로 PDF 뷰어를 열람할 수 있는 URL 제공

구분	그룹	데이터 항목
AI 학습용 데이터	기본 메타	발행기관명, 간행물명, 권호명, 권(vol), 호(issue), 통권, 발행연도, 발행월, 논문명, 저자명, 시작페이지, 종료페이지, 전체페이지, 상세페이지URL
상세 메타 데이터	발행기관 메타	발행기관 이명, 영문명, 창립연도, 홈페이지URL, 발행기관 소개
	간행물 메타	부제목, 간행물 유형, ISSN, ISBN, 최초발행연도, 폐간연도, 간행빈도, 발행주기, 등재사항, 이용수, 피인용수, 권호수, 논문수, 표지이미지
	논문 메타	작성 언어, 부제목, 대등제목, 목차, 키워드, 초록, 이미지, 참고문헌, 이용수, 피인용수, 논문활용도, DBpia통합주제분류, KDC분류, DDC분류, 한국연구재단분류, UCI, DOI
	저자 메타	소속기관, 소속부서, 직급, 연구분야, 연구키워드, 이용수, 피인용수, 저자 논문활용도

구분	그룹	데이터 항목
※ 결합형/맞춤형 메타 데이터는 신청 내용에 따라 다양하게 제공 가능
이용순위 정보	주제분야별 많이 이용된 논문	“인문학”에서 많이 이용된 논문 TOP100
	이용기관별 많이 이용된 논문	“중고등학교”에서 많이 이용된 논문 TOP100
	세부기관별 많이 이용된 논문	“서울대학교”에서 많이 이용된 논문 TOP100
	키워드별 많이 이용된 논문	“Chat GPT”에서 많이 이용된 논문 TOP100
키워드 정보	많이 이용된 키워드	특정기간/분야/저널 내 많이 이용된 키워드
	많이 발행된 키워드	특정기간/분야/저널 내 많이 발행된 키워드
	많이 검색된 키워드	특정기간/분야/저널 내 많이 검색된 키워드
	연구 트렌드 키워드	특정 키워드 연관 연구동향 분석 데이터 키워드

논문 기본 정보

이 논문의 연구 히스토리 (2)

초록· 키워드

목차

최근 본 자료

댓글(0)