Work Experience

Samsung Electronics

September 2023 – Present

Working on developing robust and explainable algorithms that can support engineers' decisions with visualized, quantitative, and objective evidence.

  • Mainly focusing on time-series and textual data.

Backend Developer, Bankware Global

July 2016 – June 2018

Worked as a backend developer using Java, Linux, and Oracle.

  • Worked as a member of the "infra solution team."
  • Developed backend process of the OK Bae-Jung Foundation scholarship website, communicating with frontend developers and UI/UX designers.

Market Researcher, Hankook Research

January 2016 – May 2016

Analyzed user-generated data mainly obtained from surveys.

  • Conducted group surveys and home visiting interviews in the Los Angeles, United States, to create a report on built-in home appliances.
  • Conducted several interviews with experts in the digital signage field.

Education

Seoul National University

Ph.D. in Industrial Engineering (공학박사) • August 2023

  • Majored in Data Mining
  • Advised by Professor Sungzoon Cho at Big Data AI Center
  • Dissertation: "Financial Risk Assessment Automation: Hot Topic Detection in Speeches, Sentiment Analysis of News Articles, and Spam Filtering on Twitter"
    (금융위험 평가 자동화: 연설문 핫토픽 탐지, 뉴스 감성 분석 및 트위터 스팸 필터링)

Korea University

Bachelor of Business Administration • February 2016

  • Internship at Linewalks (2014): Suggested ideas for a new service utilizing healthcare big data and designed marketing promotion materials.
  • Internship at Hankook Tire Hungary Ltd. (2016): Worked with Hungarian colleagues dealing with EU taxes, communicating in English.

Patents

센서 관련 지식이 강화된 언어 모델을 활용한 자동 센서명 분류 방법

출원번호: P20240102576 (2024.08.01)

차원 압축 도메인에서의 가이드라인을 이용한 Biplot 분석 방법

출원번호: P20240110061 (2024.08.16)

차원 축소 알고리즘과 군집 분석 평가 지표를 활용한 TTTM(Tool-to-Tool-Matching) 활동 관리 지수

출원번호: P20240116940 (2024.08.29)

주성분의 특징을 이용한 대량 반도체 웨이퍼의 광학스펙트럼 전 파장 데이터 비교분석 방법

출원번호: P20240144666 (2024.10.22)

Conference

센서 관련 지식이 강화된 언어 모델을 활용한 센서명 자동 분류 방법

박지혜, 최민훈, 윤동운, 유하늘, 임재용, 최인수, 김수정, 김재혁, 정지수, 조성민, 금의석
Fall Conference of Korean Institute of Industrial Engineers (KIIE), Seoul, Korea, 2024.

Machine Learning 기반 반도체 공정/설비 불량 원인 분석 방법론 [View]

유하늘, 박현섭, 임재용, 김수정, 윤동운, 박지혜, 조성민, 금의석
Spring Conference of Korean Institute of Industrial Engineers (KIIE), Yeosu, Korea, 2024.

PacMAP을 이용한 반도체 제조 공정 데이터에서의 이상 탐지 [View]

유용재, 최진우, 박지혜, 유하늘, 박광균, 임재용, 박현섭, 윤동운, 김수정, 조성민, 금의석, 조성준*
Fall Conference of Korean Institute of Industrial Engineers (KIIE), UNIST, Korea, 2023.

Post-training Approach Using Form 10-K Filings to Inject Financial Domain-specific Factual Knowledge into Pretrained Language Models for Data-driven Financial Spam Filtering

박지혜, 조성준*
Summer Conference of Korea Data Mining Society (KDMS), Gangneung, Korea, 2023.

반도체 제조 도메인 특화 지식 그래프 구축을 위한 엔티티 추출 방법 연구 [View]

박지혜, 김현종, 박소형, 조성준*, 전성환
Spring Conference of Korean Institute of Industrial Engineers (KIIE), Jeju, Korea, 2023.

Twitter Embedding for Covariance Matrix Estimation & Portfolio Optimization

안재관, 박지혜, 유용재, 조성준*, 김규진
Summer Conference of Korea Data Mining Society (KDMS), Busan, Korea, 2022.

UMAP과 HDBSCAN 군집화 기반의 반도체 제조 공정 데이터 대상 레이블 전파 방법 [View]

박지혜, 최진우, 유용재, 정수호, 인창현, 박현섭, 조기윤, 윤동운, 김상혁, 조윤혁, 이은영, 이승호, 금의석, 조성준*
Spring Conference of Korean Institute of Industrial Engineers (KIIE), Jeju, Korea, 2022.

Automatic Keyword Extraction and NER-Normalization Based Knowledge Network Construction for Patent Summarization [View]

Hye Jin Lee, Sung Whan Jeon, Jihye Park, Sungzoon Cho*
Spring Conference of Korean Institute of Industrial Engineers (KIIE), Jeju, Korea, 2021.

A frequency- and ranking-based approach for new event detection using non-contiguous bigrams from text data of the central bankers’ speeches

Jihye Park, Hye Jin Lee, Sungzoon Cho*
Fall Conference of Korea Data Mining Society (KDMS), Seoul, Korea, 2019.

InfoAnoGAN: An Interpretable Anomaly Detection Method Using InfoGAN based on Unsupervised Learning

Jihye Park, Sungzoon Cho*
Spring Conference of Korea Data Mining Society (KDMS), Seoul, Korea, 2019.

Publications

(*: Corresponding author; †: Equal contribution)

Journals

Automatic Construction of Direction-aware Sentiment Lexicon Using Direction-dependent Words []

Jihye Park, Hye Jin Lee, Sungzoon Cho* (2024), "Automatic Construction of Direction-aware Sentiment Lexicon Using Direction-dependent Words," Language Resources and Evaluation, Open access, 25 May 2024, 1-27.

Incorporation of Company-Related Factual Knowledge into Pre-trained Language Models for Stock-Related Spam Tweet Filtering []

Jihye Park, Sungzoon Cho* (2023), "Incorporation of Company-Related Factual Knowledge into Pre-trained Language Models for Stock-Related Spam Tweet Filtering," Expert Systems with Applications, Volume 234, 30 December 2023, 121021.
● ESWA has 2022 Impact Factor of 8.5, based on the Journal Citation Reports by Clarivate Analytics (released in June 2023).
[Data] [Code]

Hot Topic Detection in Central Bankers' Speeches []

Jihye Park, Hye Jin Lee, Sungzoon Cho* (2023), "Hot Topic Detection in Central Bankers' Speeches," Expert Systems with Applications, Volume 230, 15 November 2023, 120563.
● ESWA has 2021 Impact Factor of 8.665, based on the Journal Citation Reports by Clarivate Analytics (released in June 2022).
[Data] [Code]

(Unofficial) Other Activities

가짜뉴스 문제 해결을 위한 범국민 SNS 캠페인 제안

윤서윤 (중앙대학교 광고홍보학과)†, 조영아 (중앙대학교 도시계획부동산학과)†, 최영현 (중앙대학교 프랑스어문학과)†, 박재홍 (중앙대학교 역사학과)†, 박지혜*
[2023 KOSAC "시민 문화 정착 프로젝트 기획" 공모전 출품]

Music-Circles: Can Music Be Represented With Numbers?

Seokgi Kim†, Jihye Park†, Kihong Seong†, Namwoo Cho†, Junho Min†, Hwajung Hong*
arXiv preprint arXiv:2102.13350, 2021.
[Media Release] [Paper] [Demo] [Video] [Code]

Projects

Information Extraction, Data Labeling, and Dataset Shift Adaptation Algorithms for Big Data Analysis Model Construction

빅데이터 분석 모델 구축을 위한 정보 추출, 데이터 레이블링, 데이터셋 변화 적응 알고리즘 연구

National Research Foundation of Korea (한국연구재단)
September 2021 – August 2023
Scraped domain-specific textual data such as FOMC Statements and FOMC Minutes.

Research on building knowledge graph in the semiconductor domain

반도체 분야 지식 그래프 구축 연구

Samsung Advanced Institute of Technology (삼성전자 종합기술원)
February 2023 – June 2023

Research on deep learning based robust techniques for equipment anomaly detection

딥러닝을 활용한 강건한 설비 이상 탐지 기법 연구

Samsung Electronics (삼성전자 설비기술연구소)
December 2020 – June 2023

Text mining based hot topic detection from social media data

소셜미디어 텍스트 분석을 통한 투자 관련 핫토픽 탐지

NH Investment & Securities (NH투자증권)
September 2021 – December 2021

Development of automatic classification system for documents using artificial intelligence technology

인공지능 기술을 통한 문서 분류 자동화 시스템 구축

Samsung Electronics (삼성전자 혁신센터)
June 2019 – March 2021

Teaching

Lecturer (시간강사), Chung-Ang University, Department of Advertising and Public Relations, "Digital Analytics" Course

Spring 2023

중앙대학교 경영경제대학 광고홍보학과 전공선택 과목 "디지털 애널리틱스"
[]

Teaching Assistant, NH Nonghyup Bank (NH농협은행) "4th Industrial Revolution Core Talent Cultivation" Course

Summer 2022, Summer 2023

Provided technical support for big data analysis program for NH Nonghyup Bank employees.

Lecturer, Korea Arts Management Service (예술경영지원센터) "Data Analysis" Course

Fall 2022

Presented lectures on MS Excel-based data analysis techniques.

Lecturer, Grand Korea Leisure (그랜드코리아레저) "Data Analysis" Course

Summer 2022

Presented video lectures on MS Excel-based data analysis techniques.

Lecturer, Kyungbok University (경복대학교) "Basics of Big Data Analysis" Course

Summer 2022

Presented video lectures on MS Excel-based data analysis techniques.

Teaching Assistant, MKYU "Big Data Analysis" Course

Spring 2022

Provided feedbacks on data-driven value creation proposals written by MKYU students.

Teaching Assistant, LG U+ (엘지유플러스) "Dream Big Data" Course

Fall 2021

Provided technical support for Digital Transformation (DX) training program for LG U+ employees.

Lecturer, SNU Language Education Institute (서울대학교 언어교육원)

November 2019 – February 2021

Presented lectures on Python programming and machine learning for government-invited Malaysian students.

Teaching Assistant, SNU Data Mining Camp / SNU Big Data AI Camp

February 2019 – August 2023

Assisted lectures on introduction to data mining for high school students.

A Little More About Me

  • I love music. My favorite artists include 윤하, Taylor Swift, and Chopin. I also enjoy playing the piano. Music always helps me relax.
  • During my USA trip on July 2023, I found out that I liked paintings! I enjoy the calm and vibrant atmosphere of an art museum :). My favorite artists include Claude Monet and Henri Matisse; I love the colors in their works.
  • I enjoy swimming XD. Just like listening to good music and eating delicious food, working out helps me stay calm, strong, and positive.
  • I tend to be easily moved by warm words. I like people :D.
  • I like to record my thoughts.. and feelings.