Academia
Kyle Hamilton from Technological University Dublin collaborated with Label Your Data for her research on rhetorical devices of propaganda used in news articles.
Kyle Hamilton is a PhD Researcher at TU Dublin who focuses on applying neuro-symbolic AI to detect propaganda in news feeds.
Kyle needed a high-quality text dataset containing classified propaganda-related sentences to compare humans and ChatGPT in detecting propaganda in news.
Label Your Data provided skilled annotators with linguistic background to classify and label 357 sentences using the Client’s platform.
Despite inconsistencies in human annotations, they prove more reliable in propaganda analysis due to ChatGPT’s lack of real-world knowledge.
Kyle Hamilton is a PhD Researcher at Technological University Dublin. Her research work is centered around neuro-symbolic AI for detecting propaganda in news articles.
Holding a Master’s in Information and Data Science and a Bachelor of Fine Art, Kyle brings a unique interdisciplinary approach to her work.
Most efforts to automate the detection of propaganda and misinformation are mainly focused on using natural language processing (NLP).
Following the same idea, Kyle aimed to create a tool that helps identify propaganda in news. But first, she wanted to compare ChatGPT and human linguists in performing the same task.
Exploring the possibilities of using ChatGPT for automated propaganda detection.
Hiring an expert annotation team with domain experts to compare human annotations to ChatGPT’s responses.
Getting high-quality annotated text dataset for the research.
Dealing with the subjective nature of the sentence classification task.
Ratio of partial agreement among all three annotators for each feature
Chat GPT Agreement among itself when prompted 3 times
Verb choices
Tropes
Tense
Subject choices
Series
Sentence architecture
Prosody and punctuation
Predication
Phrases built on verbs
Phrases built on nouns
Parallelism
New words and changing uses
Mood
Modifying phrases
Modifying clauses
Lexical and semantic fields
Language varieties
Language of origin
Figures of word choice
Figures of argument
Emphasis
Aspect
Hiring 3 dedicated data annotators with linguistic background for classification of 357 sentences.
Working in a flexible mode to seamlessly integrate into Kyle's annotation platform.
Conducting a cross-reference QA to address the project’s subjectivity.
Delivering high-quality annotated text corpus.
This initial phase aimed at supporting Kyle Hamilton’s research and securing the grant. While findings are initial, Label Your Data expects a larger dataset to annotate.
Though human experts’ annotations vary, their real-world knowledge surpasses AI models like ChatGPT, which solely relies on internet-derived data.
Analyzed the discrepancies in agreement between annotators and ChatGPT.
Identified the most challenging areas to achieve consensus:
Despite inconsistencies, defined human annotators as more reliable for propaganda analysis.
Delivering high-quality annotated text corpus.
PhD Researcher at TU Dublin
Check our performance based on a free trial
Pay per labeled object or per labeling hour
Working with every labeling tool, even your custom tools
Work with a data-certified vendor: PCI DSS Level 1, ISO:2700, GDPR, CCPA