Skip to content
View yangheng95's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report yangheng95

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
yangheng95/README.MD

Hi there!

I am a PhD student at the University of Exeter, specializing in genomics, bioinformatics, and Large Language Models (LLMs). My research includes biological sequence modeling, sentiment analysis, adversarial attacks, and open-source tool development. I’ve contributed to the field with OmniGenomeBench and PyABSA, and have published in leading conferences like ACL, EMNLP, CIKM, EACL, IEEE TSE, etc.

I actively contribute to platforms like GitHub and Hugging Face, sharing tools like FindFile, MetricVisualizer, BoostAug. I’m committed to advancing AI and NLP while making these technologies accessible through open-source contributions.

I expect to graduate in 2025 and am open to new opportunities in academia and industry.


Publications


GitHub Projects

A large-scale in-silico benchmarking framework for genomic foundation models (GFMs). It addresses the lack of standardized tools for evaluating GFMs, automating the benchmarking process for diverse models. Features a public leaderboard for tracking performance across models.

  • GitHub stars
  • PePy Downloads

A modularized framework for Aspect-Based Sentiment Analysis (ABSA). PyABSA simplifies sentiment analysis tasks with pre-trained models and datasets for research and production environments.

  • GitHub stars
  • PePy Downloads

Hugging Face Models

  • DeBERTa-v3 Base ABSA
    A model for aspect-based sentiment analysis (ABSA), trained with over 30k samples for tasks like sentiment classification.

  • PlantRNA-FM
    An interpretable RNA foundation model for exploring functional RNA motifs in plants. Pre-trained on data from over 1,124 plant species.

  • OmniGenome
    A genomic model aimed at biological sequence modeling, part of the OmniGenome project.


Community Contribution

yangheng95 GitHub Stats

Profile Views: Profile Views

Pinned Loading

  1. OmniGenBench OmniGenBench Public

    Provide RNA and DNA Foundation Model Benchmarks and Applications

    Python 9 1

  2. PyABSA PyABSA Public

    Sentiment Analysis, Text Classification, Text Augmentation, Text Adversarial defense, etc.;

    Jupyter Notebook 952 161

  3. ABSADatasets ABSADatasets Public

    Public & Community-shared datasets for Aspect-based sentiment analysis and Text Classification

    HTML 205 64

  4. SuperResolutionAnimeDiffusion SuperResolutionAnimeDiffusion Public

    Super Resolution Anime Diffusion, waifu2x

    Python 36 5

  5. BoostTextAugmentation BoostTextAugmentation Public

    Python 15

  6. InstOptima InstOptima Public

    This repo is for our EMNLP2023 short paper (Findings): InstOptima: Evolutionary Multi-objective Instruction Optimization via Large Language Model-based Instruction Operators.

    Python 11 3