Towards a Programmable Humanizing AI through Scalable Stance-Directed Architecture

Welcome to the Stance-Directed Humanizing AI repository. The proposed pipeline aims to reduce the generation of toxic narratives in digital communications by leveraging the power of generative artificial intelligence (AI) fine-tuned on positive human values. Our approach emphasizes the importance of fostering social cohesion and understanding through language, counteracting the spread of harmful content online.

This study introduces a novel pipeline to train Large Language Models (LLM) for generating tweets that are not only relevant to given aspects and entities but also aligned with healthier discourse and constructive sentiments. This pipeline utilizes a toxic content classifier to ensure generated tweets are non-toxic and employs a stance-aware aspect-based sentiment analysis (ABSA) model to extract stances from these tweets, promoting a more civil and humanized interaction on social media platforms where it is demonstrated on contentious real-world Twitter dataset on U.S. race relations.

Key Components

Stance-Directed Tweet Generator: Based on aspects and entities, this model generates tweets that aim to reflect humanized and constructive discourse.
Toxic Content Classifier: This component classifies the generated tweets as toxic or non-toxic, ensuring the promotion of positive engagement.
Stance-Aware ABSA Model: Extracts the stance of the generated tweet towards the specified aspects and entities, facilitating a deep understanding of sentiments.

Datasets

The study incorporates five datasets:

TrainTweetsForHumanizedLLM.csv and TrainTweetsForUnrestrictedLLM.csv: Training data for humanized and unrestricted LLMs respectively.
ToxicClassifierDataset.csv: Training data for the toxic content classifier.
GoldToxicDataset.csv: Golden outputs for evaluating the toxicity classifier's performance, labelled by 3 annotators with a Krippendorff's alpha nominal score of 0.73, indicating a good level of inter-annotator agreement.
GeneratedOutputsWithLabels.csv: The generated tweets using humanized and unrestricted LLMs labelled by 3 annotators and provided with classifier model predictions, indicating a Krippendorff's alpha nominal score of 0.75, further showing a reliable consensus among annotators.

Using This Repository

To simulate our study and see the models in action:

Select Camp, Aspects, and Entities: Begin by specifying the camp along with the aspects and entities you are interested in.
Generate a Tweet: The tweet generator model will produce a tweet based on your input.
View ABSA Outputs: Analyze the fine-grained sentiments and stances extracted from the generated tweet.
Toxic/Non-Toxic Label: Determine whether the generated tweet is considered toxic or non-toxic.

How to Run the App on Huggingface Space or Locally

The study is available as a Huggingface Space and can be accessed here.

To run the app locally, you can use the following command:

git clone https://huggingface.co/spaces/tweetpie/stance-directed-humanizing-ai
cd stance-directed-humanizing-ai
pip install -r requirements.txt

streamlit run app.py

Sample input/output:

Ideology: Left
Pro Entities: ['migrant worker rights groups']
Anti Entities: ['labor exploitation']
Neutral Entities: ['agricultural sector']
Pro Aspects: ['fair treatment', 'safety standards']
Anti Aspects: []
Neutral Aspects: ['employment laws', 'worker visas']

Generated Tweet: "the agricultural sector is the single biggest recipient of migrants workers rights groups argue . nearly 90 % of those who come to the us are denied employment due to discriminatory employment laws and safety standards ."
ABSA Outputs:
    Aspect: migrants, Sentiment: positive
    Aspect: rights, Sentiment: positive
    Aspect: laws, Sentiment: positive
    Aspect: safety, Sentiment: positive 
Toxic/Non-Toxic Label: Non-Toxic

Contributions and Feedback

We encourage contributions and feedback to improve this project. If you have suggestions or want to contribute, please open an issue or pull request on our GitHub repository.

Citation

If you use our work, please cite our paper:

@article{ccetinkaya2024towards,
  title={Towards a Programmable Humanizing AI through Scalable Stance-Directed Architecture},
  author={{\c{C}}etinkaya, Yusuf M{\"u}cahit and Lee, Yeonjung and K{\"u}lah, Emre and Toroslu, {\.I}smail Hakk{\i} and Cowan, Michael A. and Davulcu, Hasan},
  journal={},
  year={2024},
  volume={}
}

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
data		data
images		images
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Towards a Programmable Humanizing AI through Scalable Stance-Directed Architecture

Key Components

Datasets

Using This Repository

How to Run the App on Huggingface Space or Locally

Contributions and Feedback

Citation

About

Releases

Packages

tweetpie/stance-directed-humanizing-ai

Folders and files

Latest commit

History

Repository files navigation

Towards a Programmable Humanizing AI through Scalable Stance-Directed Architecture

Key Components

Datasets

Using This Repository

How to Run the App on Huggingface Space or Locally

Contributions and Feedback

Citation

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages