본문 바로가기

NLP

NLP overview

*사용된 모든 영문 image의 출처는 cs231n 강의 자료입니다.*

 

<NLP overview>

    1. NLP Applications

    2. Academic Disciplines related to NLP

    3. Trends of NLP


NLP Applications

 

 

NLP(Natural Language Processing)

  • 많은 양의 natural language datafmf computer program process에 적용시키는 방법
  • NLP goal: Language의 contextual nuances를 포함하여 document의 contents를 computer가 'understanding'하게 하는 것

 

 

NLP  Applications

  • Text Classification: Spam Detection (INBOX - SPAM FOLDER), Sentiment Analysis ···
  • Question Answering: Search Engine (Question에 대한 이해) ···
  • Machine Translation (Source language → Target language)
  • Chatbot
  • Personal Assistant (Sound → Natural Language → Command)
  • Text Summarization (Input document → Abstractive summarization / Extractive Summarization)

Academic Disciplines related to NLP

 

 

Natural Language processing

  • Major conferences: ACL, EMNLP, NAACL

- Low-level parsing: Tokenization, stemming

- Word and phrase level: Named Entity Recognition, Part-Of-Speech, Semantic relation extraction

- Sentence level: Sentiment analysis, Machine trainslation

- Multi-sentence and paragraph level: Entailment prediction, Question answering, Dialog systems (chatbot), Summarization

 

Text mining

  • Major conferences: KDD, The WebConf, WSDM, CIKM, ICWSM

- Text와 document data에서 유용한 insight 추출

- Document clustering (e.g. topic modeling)

- Computational Social Science: sns에서 사람들의 사회, 과학적인 insight 도출

 

Information retrieval

  • Major conferences: SIGIR, WSDM, CIKM, RecSys

- Recommendation system


Trends of NLP

 

 

Trends of NLP

  • Text data는 sequence of words로
  • 각 word는 embedding vector로 표현 (e.g. Word2Vec, Glove)
  • Seqeuntial data를 RNN, LSTM, GRU 등의 RNN-based model로 분석
  • Attention과 Transformer 모델 등장으로 RNN은 self-attention으로 대체
  • 최근에는 Self-Supervised model (No label)을 사용한 BERT 등의 model이 large dataset을 fine-tuned하여 다양한 downstream task에서 사용

'NLP' 카테고리의 다른 글

Topic Modeling  (0) 2022.06.17
Bag-of-Words  (0) 2022.06.16
NLP 이해하기  (0) 2022.04.07
구글 BERT의 정석 정리(수기)  (0) 2022.04.07
파텍마 정리 사진(수기)  (0) 2022.04.07