A Performance Comparison of NLP Text Pre-Processing Techniques for Analysing Personality Based on Myers Briggs Type Indicator (MBTI)
Myers Briggs Type Indicator (MBTI) is well-known instrument for personality evaluation and is frequently being employed in the areas of personal development, career counselling or team building. It categorised people into sixteen personality types as how people expressed themselves to certain traits...
Published in: | 2023 IEEE International Conference on Computing, ICOCO 2023 |
---|---|
Main Author: | |
Format: | Conference paper |
Language: | English |
Published: |
Institute of Electrical and Electronics Engineers Inc.
2023
|
Online Access: | https://www.scopus.com/inward/record.uri?eid=2-s2.0-85184854739&doi=10.1109%2fICOCO59262.2023.10397603&partnerID=40&md5=ec24db640f9954adcc6cc73307651d1b |
id |
2-s2.0-85184854739 |
---|---|
spelling |
2-s2.0-85184854739 Rauf M.H.A.; Johan E.J.; Aziz Z.A. A Performance Comparison of NLP Text Pre-Processing Techniques for Analysing Personality Based on Myers Briggs Type Indicator (MBTI) 2023 2023 IEEE International Conference on Computing, ICOCO 2023 10.1109/ICOCO59262.2023.10397603 https://www.scopus.com/inward/record.uri?eid=2-s2.0-85184854739&doi=10.1109%2fICOCO59262.2023.10397603&partnerID=40&md5=ec24db640f9954adcc6cc73307651d1b Myers Briggs Type Indicator (MBTI) is well-known instrument for personality evaluation and is frequently being employed in the areas of personal development, career counselling or team building. It categorised people into sixteen personality types as how people expressed themselves to certain traits such as introversion, extroversion, reasoning against feeling, sense and intuition. The main motivation for this work is to predict the classification of these personalities with the usage of machine learning and natural language processing (NLP) thus the goal of this work is to analyse and compare the performance of three NLP text pre-processing features to classify the MBTI personalities in order to increase the accuracy of the prediction model. Bag of Words (BoW), Words Embedding (Word2Vec) and Transfer Learning with Bidirectional Encoder Representation from Transformer (BERT) are methods being compared for their performances in this work. Results shown that transfer learning (BERT) has the highest accuracy followed by word embedding (Word2Vec), and lastly the BoW. © 2023 IEEE. Institute of Electrical and Electronics Engineers Inc. English Conference paper |
author |
Rauf M.H.A.; Johan E.J.; Aziz Z.A. |
spellingShingle |
Rauf M.H.A.; Johan E.J.; Aziz Z.A. A Performance Comparison of NLP Text Pre-Processing Techniques for Analysing Personality Based on Myers Briggs Type Indicator (MBTI) |
author_facet |
Rauf M.H.A.; Johan E.J.; Aziz Z.A. |
author_sort |
Rauf M.H.A.; Johan E.J.; Aziz Z.A. |
title |
A Performance Comparison of NLP Text Pre-Processing Techniques for Analysing Personality Based on Myers Briggs Type Indicator (MBTI) |
title_short |
A Performance Comparison of NLP Text Pre-Processing Techniques for Analysing Personality Based on Myers Briggs Type Indicator (MBTI) |
title_full |
A Performance Comparison of NLP Text Pre-Processing Techniques for Analysing Personality Based on Myers Briggs Type Indicator (MBTI) |
title_fullStr |
A Performance Comparison of NLP Text Pre-Processing Techniques for Analysing Personality Based on Myers Briggs Type Indicator (MBTI) |
title_full_unstemmed |
A Performance Comparison of NLP Text Pre-Processing Techniques for Analysing Personality Based on Myers Briggs Type Indicator (MBTI) |
title_sort |
A Performance Comparison of NLP Text Pre-Processing Techniques for Analysing Personality Based on Myers Briggs Type Indicator (MBTI) |
publishDate |
2023 |
container_title |
2023 IEEE International Conference on Computing, ICOCO 2023 |
container_volume |
|
container_issue |
|
doi_str_mv |
10.1109/ICOCO59262.2023.10397603 |
url |
https://www.scopus.com/inward/record.uri?eid=2-s2.0-85184854739&doi=10.1109%2fICOCO59262.2023.10397603&partnerID=40&md5=ec24db640f9954adcc6cc73307651d1b |
description |
Myers Briggs Type Indicator (MBTI) is well-known instrument for personality evaluation and is frequently being employed in the areas of personal development, career counselling or team building. It categorised people into sixteen personality types as how people expressed themselves to certain traits such as introversion, extroversion, reasoning against feeling, sense and intuition. The main motivation for this work is to predict the classification of these personalities with the usage of machine learning and natural language processing (NLP) thus the goal of this work is to analyse and compare the performance of three NLP text pre-processing features to classify the MBTI personalities in order to increase the accuracy of the prediction model. Bag of Words (BoW), Words Embedding (Word2Vec) and Transfer Learning with Bidirectional Encoder Representation from Transformer (BERT) are methods being compared for their performances in this work. Results shown that transfer learning (BERT) has the highest accuracy followed by word embedding (Word2Vec), and lastly the BoW. © 2023 IEEE. |
publisher |
Institute of Electrical and Electronics Engineers Inc. |
issn |
|
language |
English |
format |
Conference paper |
accesstype |
|
record_format |
scopus |
collection |
Scopus |
_version_ |
1809677780047101952 |