Skip to content

A sample dataset of over 1000 Twitter (X) posts, extracted using the Bright Data API, ideal for trend discovery, brand monitoring, and competitive insights.

Notifications You must be signed in to change notification settings

luminati-io/Twitter-X-dataset-samples

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 

Repository files navigation

Twitter-dataset-samples

A sample dataset of 1001 Twitter posts

Twitter dataset header

A Twitter dataset sample of over 1000 posts. Dataset was extracted using the Bright Data API.

  • id: The unique identifier for the post
  • user_posted: The username of the post owner
  • name: The name of the post owner
  • description: The text description of the post
  • date_posted: The date when the post was published
  • photos: URLs of any photos attached to the post
  • videos: URLs of any videos attached to the post
  • url: The URL link to the post
  • quoted_post: Details of the quoted post within the main post
  • tagged_users: A list of profiles tagged in the post
  • replies: The total number of replies the post has received
  • reposts: The total number of reposts the post has received
  • likes: The total number of likes the post has received
  • views: The total number of views the post has received
  • external_url: The external URL included in the post
  • hashtags: The hashtags included in the post
  • followers: The number of followers the profile has
  • biography: The bio of the post owner
  • posts_count: The total number of posts the profile has made
  • profile_image_link: The URL to the profile image
  • following: The number of profiles the user follows
  • is_verified: Indicates whether the user is verified (True/False)
  • quotes: The total number of times the post has been quoted
  • bookmarks: The total number of times the post has been bookmarked
  • parent_post_details: Details of the parent post, if applicable

And a lot more.

This is a sample subset which is derived from the "Twitter Posts (public data)" dataset which includes more than 1,000,000 posts.

Available dataset file formats: JSON, NDJSON, JSON Lines, CSV, or Parquet. Optionally, files can be compressed to .gz.

Dataset delivery type options: Email, API download, Webhook, Amazon S3, Google Cloud storage, Google Cloud PubSub, Microsoft Azure, Snowflake, SFTP.

Update frequency: Once, Daily, Weekly, Monthly, Quarterly, or Custom basis.

Data enrichment available as an addition to the data points extracted: Based on request.

Get the full Twitter dataset.

What are the Twitter datasets use cases?

1. Trend Discovery

Uncover emerging trends and opportunities by tracking public conversations on Twitter. Monitor retweets, likes, replies, and mentions to identify key topics and shifts in user sentiment. Use the Twitter dataset to gain valuable insights into customer opinions and evolving market trends.

2. Brand Monitoring

Understand public sentiment about your brand, product, or service by analyzing Twitter data. Track changes in popularity through likes, comments, shares, hashtags, and mentions to stay ahead of trends.

3. Competitive Market Insights

Gain a competitive edge by assessing the social media activity of rival brands. Review hashtags, posts, and user engagement on Twitter to refine your strategy and outperform competitors.

Free access to web scraping tools and datasets for academic researchers and NGOs

The Bright Initiative offers access to Bright Data's Web Scraper APIs and ready-to-use datasets to leading academic faculties and researchers, NGOs and NPOs promoting various environmental and social causes. You can submit an application here.

About

A sample dataset of over 1000 Twitter (X) posts, extracted using the Bright Data API, ideal for trend discovery, brand monitoring, and competitive insights.

Topics

Resources

Stars

Watchers

Forks