Education

Seoul National University

Ph.D. in Industrial Engineering (공학박사) • August 2023

  • Majored in Data Mining
  • Advised by Professor Sungzoon Cho at Big Data AI Laboratory
  • Dissertation: "Financial Risk Assessment Automation: Hot Topic Detection in Speeches, Sentiment Analysis of News Articles, and Spam Filtering on Twitter"
    (금융위험 평가 자동화: 연설문 핫토픽 탐지, 뉴스 감성 분석 및 트위터 스팸 필터링)

Korea University

Bachelor of Business Administration • February 2016

Work Experience

Staff Engineer, Samsung Electronics (Mechatronics Research Center)

September 2023 – Present

Analyzing "equipment maintenance/operation texutal records (written by hardware engineers)" and "sensor data" for root cause analysis (RCA) of equipment failures

  • Programming via Python, JavaScript, Spotfire Expression, and other tools.

Software Developer, Bankware Global

July 2016 – June 2018

Worked as a backend software developer using Java, Linux, and Oracle.

  • Fixed software bugs related to thread that calculates java byte code coverage data and other statistical information.
  • Developed backend process of the OK Bae-Jung Foundation scholarship website.

Market Researcher, Hankook Research

January 2016 – May 2016

Analyzed user-generated data mainly obtained from surveys.

  • Conducted group surveys and home visiting interviews in the Los Angeles, United States, to create a report on built-in home appliances.

Publications

(*: Corresponding author; †: Equal contribution)

Journals

Automatic Construction of Direction-aware Sentiment Lexicon Using Direction-dependent Words

Jihye Park, Hye Jin Lee, Sungzoon Cho* (2024), "Automatic Construction of Direction-aware Sentiment Lexicon Using Direction-dependent Words," Accepted for Publication to Language Resources and Evaluation.
● The first version of this manuscript previously uploaded to arXiv—a repository that facilitates the free distribution of scholarly work—has been cited four times by other research works.

Incorporation of Company-Related Factual Knowledge into Pre-trained Language Models for Stock-Related Spam Tweet Filtering []

Jihye Park, Sungzoon Cho* (2023), "Incorporation of Company-Related Factual Knowledge into Pre-trained Language Models for Stock-Related Spam Tweet Filtering," Expert Systems with Applications, Volume 234, 30 December 2023, 121021.
● ESWA has 2022 Impact Factor of 8.5, based on the Journal Citation Reports by Clarivate Analytics (released in June 2023).
[Data] [Code]

Hot Topic Detection in Central Bankers' Speeches []

Jihye Park, Hye Jin Lee, Sungzoon Cho* (2023), "Hot Topic Detection in Central Bankers' Speeches," Expert Systems with Applications, Volume 230, 15 November 2023, 120563.
● ESWA has 2021 Impact Factor of 8.665, based on the Journal Citation Reports by Clarivate Analytics (released in June 2022).
[Data] [Code]

Conferences

PacMAP을 이용한 반도체 제조 공정 데이터에서의 이상 탐지 [View]

유용재, 최진우, 박지혜, 유하늘, 박광균, 임재용, 박현섭, 윤동운, 김수정, 조성민, 금의석, 조성준*
Fall Conference of Korean Institute of Industrial Engineers (KIIE), UNIST, Korea, 2023.

Post-training Approach Using Form 10-K Filings to Inject Financial Domain-specific Factual Knowledge into Pretrained Language Models for Data-driven Financial Spam Filtering

박지혜, 조성준*
Summer Conference of Korea Data Mining Society (KDMS), Gangneung, Korea, 2023.

반도체 제조 도메인 특화 지식 그래프 구축을 위한 엔티티 추출 방법 연구 [View]

박지혜, 김현종, 박소형, 조성준*, 전성환
Spring Conference of Korean Institute of Industrial Engineers (KIIE), Jeju, Korea, 2023.

Twitter Embedding for Covariance Matrix Estimation & Portfolio Optimization

안재관, 박지혜, 유용재, 조성준*, 김규진
Summer Conference of Korea Data Mining Society (KDMS), Busan, Korea, 2022.

UMAP과 HDBSCAN 군집화 기반의 반도체 제조 공정 데이터 대상 레이블 전파 방법 [View]

박지혜, 최진우, 유용재, 정수호, 인창현, 박현섭, 조기윤, 윤동운, 김상혁, 조윤혁, 이은영, 이승호, 금의석, 조성준*
Spring Conference of Korean Institute of Industrial Engineers (KIIE), Jeju, Korea, 2022.

Automatic Keyword Extraction and NER-Normalization Based Knowledge Network Construction for Patent Summarization [View]

Hye Jin Lee, Sung Whan Jeon, Jihye Park, Sungzoon Cho*
Spring Conference of Korean Institute of Industrial Engineers (KIIE), Jeju, Korea, 2021.

A frequency- and ranking-based approach for new event detection using non-contiguous bigrams from text data of the central bankers’ speeches

Jihye Park, Hye Jin Lee, Sungzoon Cho*
Fall Conference of Korea Data Mining Society (KDMS), Seoul, Korea, 2019.

InfoAnoGAN: An Interpretable Anomaly Detection Method Using InfoGAN based on Unsupervised Learning

Jihye Park, Sungzoon Cho*
Spring Conference of Korea Data Mining Society (KDMS), Seoul, Korea, 2019.

Teaching

Lecturer (시간강사), Chung-Ang University, Department of Advertising and Public Relations, "Digital Analytics" Course

Spring 2023

중앙대학교 경영경제대학 광고홍보학과 전공선택 과목 "디지털 애널리틱스"
[]

Teaching Assistant, NH Nonghyup Bank (NH농협은행) "4th Industrial Revolution Core Talent Cultivation" Course

Summer 2022, Summer 2023

Provided technical support for big data analysis program for NH Nonghyup Bank employees.

Teaching Assistant, LG U+ (엘지유플러스) "Dream Big Data" Course

Fall 2021

Provided technical support for Digital Transformation (DX) training program for LG U+ employees.

Lecturer, SNU Language Education Institute (서울대학교 언어교육원)

November 2019 – February 2021

Presented lectures on python programming and machine learning for government-invited Malaysian students.

Projects

Information Extraction, Data Labeling, and Dataset Shift Adaptation Algorithms for Big Data Analysis Model Construction

빅데이터 분석 모델 구축을 위한 정보 추출, 데이터 레이블링, 데이터셋 변화 적응 알고리즘 연구

National Research Foundation of Korea (한국연구재단)
September 2021 – August 2023
Scraped domain-specific textual data such as FOMC Statements and FOMC Minutes.

Research on building knowledge graph in the semiconductor domain

반도체 분야 지식 그래프 구축 연구

Samsung Advanced Institute of Technology (삼성전자 종합기술원)
February 2023 – June 2023

Research on deep learning based robust techniques for equipment anomaly detection

딥러닝을 활용한 강건한 설비 이상 탐지 기법 연구

Samsung Electronics (삼성전자 설비기술연구소)
December 2020 – June 2023

Text mining based hot topic detection from social media data

소셜미디어 텍스트 분석을 통한 투자 관련 핫토픽 탐지

NH Investment & Securities (NH투자증권)
September 2021 – December 2021

Development of automatic classification system for documents using artificial intelligence technology

인공지능 기술을 통한 문서 분류 자동화 시스템 구축

Samsung Electronics (삼성전자 혁신센터)
June 2019 – March 2021

A Little More About Me

arXiv

Music-Circles: Can Music Be Represented With Numbers?

Seokgi Kim†, Jihye Park†, Kihong Seong†, Namwoo Cho†, Junho Min†, Hwajung Hong*
arXiv preprint arXiv:2102.13350, 2021.
[Media Release] [Paper] [Demo] [Video] [Code]