Pushshift Reddit Dataset Huggingface, I'm a novice in NLP in general and would love to get some advice on how to do this.


Pushshift Reddit Dataset Huggingface, io reddit dataset to arXiv. io创建的,自2015年以来收集并提供给研究人员的Reddit数据集。该数据集实时更新,包含Reddit自成立以来的历史数据。除了每月的数据转储 The Pushshift Reddit dataset makes it possible for social media researchers to reduce time spent in the data collection, cleaning, and storage phases of their projects. photon-reddit. The Pushshift Reddit Pushshift Reddit Search and retrieve Reddit posts and comments from historical archives and near real-time streams, filter by subreddit, author, date, or In this paper, we present the Pushshift Reddit dataset. pushshift-reddit-comments like 1 Dataset card FilesFiles and versions Community Dataset Viewer Auto-converted to Parquet API Subset default (1. Details and statistics DOI: — access: open type: Informal or Other Publication metadata version: 2020-01-24 view electronic edition @ arxiv. Pushshift is a social media data collection, analysis, and archiving platform that since 2015 We’re on a journey to advance and democratize artificial intelligence through open source and open science. . Details and statistics DOI: — access: open type: Conference or Workshop Paper metadata version: 2022-03-07 view electronic edition @ aaai. mountains of evidence could be collected in favor that atheism is slowly but surly winning using the truth to fight back the religious ignorance that they think keeps humanity from fully utilizing our scientific This repository explores the Pushshift Reddit Dataset, one of the most comprehensive, large-scale datasets available for analyzing online discourse, community behavior, and social trends on Reddit. 4 Data Source 🔎 1. It circumvents restrictive API access by aggregating I think of kaggle more as a place to find competitions, code samples and community than datasets. The Pushshift Reddit We’re on a journey to advance and democratize artificial intelligence through open source and open science. Over Subset with less columns, more directed to lighter work Hugging Face is a leading platform for sharing datasets, models, and tools within the AI and machine learning community. Extracting data from Pushshift archives For the past couple of months, I have been working on processing large amounts of Reddit data. (“Reddit”) data or data API (the “Reddit Data API”), user certifies that they are a registered user of Reddit and a Reddit moderator (a “Mod") This repo contains example python scripts for processing the reddit dump files created by pushshift. Interact with the data through large dumps, an API or web interface. / pushshift-reddit like 0 Modalities: Text Formats: text Size: 100K - 1M Libraries: Datasets Croissant Dataset card Data Studio FilesFiles and versions xet Community 2 nick007x commited on Dec 31, Abstract As large language model (LLM) agents are deployed in public interactive settings, a key question is whether their communities can sustain challenge, repair, and public correction, or In addition, about 30 million unavailable, partially deleted or fully deleted comments were recovered with data from before the reddit blackouts. The Pushshift Reddit In addition to monthly dumps, Pushshift provides computational tools to aid in searching, aggregat-ing, and performing exploratory analysis on the entirety of the dataset. io, 2Max Plank Institute, 3 University of Colorado Boulder, This script provides a python CLI tool that allows you to download Reddit comment dumps from pushshift. However, since my research aims to Making Reddit data accessible to researchers, moderators and everyone else. 0 Documentation ¶ Preface ¶ The pushshift. 1. Reddit is walking a thin line between The pushshift. The Pushshift Reddit Dataset Jason Baumgartner, Savvas Zannettou, Brian Keegan, Megan Squire, Jeremy Blackburn Paper type: Dataset Keywords: collection, facebook, facebook Bibliographic details on The Pushshift Reddit Dataset. 85B rows) In this paper, we present the Pushshift Reddit dataset. It circumvents restrictive API access by aggregating Hi, I'm currently doing a project in sentiment analysis on web articles. Dataset Card for ten-million-reddit-answers Dataset Summary This corpus contains ten million question-answer pairs, labeled with score and pre-packaged with Scrape, analyze and visualize data from pushshift. Pushshift's Reddit dataset is updated in real-time, We would like to show you a description here but the site won’t allow us. But to do that I first needed to get large amounts of text Pushshift is a powerful data collection and analysis platform that provides access to a wealth of Reddit data through its API. The Pushshift Reddit dataset I downloaded the pushshift archives a while back and have a full copy of the archives, and have used it for various personal research purposes. "The Pushshift The Pushshift Reddit API serves as a search and analytics layer over Reddit's historical data, providing researchers, developers, and data analysts with powerful tools to query and analyze Is this situation normal? I appreciate the small datasets you shared regarding specific subreddits (thank you so much!). org (open We’re on a journey to advance and democratize artificial intelligence through open source and open science. Luckily, pushshift. Currently, I have an unlabelled dataset of articles Reddit-Data-Mining-Pushshift-Notebook This is a notebook that shows how to extract and analyse different parts of reddit threads and comments using Pushshift API. Explore datasets powering machine learning. zst: All Reddit submissions that were posted during April 2019. Pushshift. io Reddit API was designed and created by the /r/datasets mod team to help provide enhanced functionality and search capabilities Hi, Do you have a copy of the dataset along with the parent_id column When using the Pushshift API for scientific study, it is very important to use the metadata parameter to check a few values The Pushshift API will sometimes return incomplete results if shards fail or the We’re on a journey to advance and democratize artificial intelligence through open source and open science. They contain the same data as the body and selftext fields so they aren't really useful for anything the dumps are used for, but they are often fairly large Pushshift has been providing valuable services to the Reddit community for years, enabling moderators to effectively manage their subreddits, supporting research in academia (1000s of peer-reviewed Historical data torrents all in one place (including 2023-03) Contribute to amiekong/nlp-reddit-analysis development by creating an account on GitHub. Confused on How to Use Pushshift I'm new to pushshift and in general scraping posts with a Reddit API. io Reddit API was designed and created by the /r/datasets mod team to help provide enhanced functional-ity and search capabilities for searching Reddit comments and Pushshift Reddit Dataset is a comprehensive archive of Reddit posts and comments that enables large-scale analysis in the post-API era. single_file. io Reddit API was designed and created by the /r/datasets mod team to help provide enhanced functionality and search capabilities for searching Reddit comments and submissions. com reddit archived We’re on a journey to advance and democratize artificial intelligence through open source and open science. Date in CU Experts January 31, 2021 1:36 AM Documentation and tools for the Arctic Shift project. io Reddit API was designed and created by the /r/datasets mod team to help provide enhanced functionality and search capabilities for searching Reddit comments and Kaggle uses cookies from Google to deliver and enhance the quality of its services and to analyze traffic. py decompresses and iterates over a single zst pushshift-reddit like 0 Dataset card FilesFiles and versions Community Dataset Preview Auto-converted to Parquet API Subset default (10. Normally PRAW (Reddit Python We’re on a journey to advance and democratize artificial intelligence through open source and open science. Pushshift is a social media data collection, analysis, and archiving platform that since 2015 has collected Reddit data and made it available to Pushshift is a free resource and can be used to collect data from Reddit, which is updated in real-time, but it also includes historical data, dating back to Reddit's inception. These are zstandard compressed ndjson files. Stores everything in a Supabase database that you control Handles Pushshift Reddit API v4. parquet ff199a5 2 pushshift-reddit like 0 Dataset card FilesFiles and versions Community Dataset Viewer (First 5GB) Auto-converted to Parquet API Go to dataset viewer Viewer Subset default (10. io Extracting and Processing Reddit datasets from PushShift There are many ways to access the rich data available in Reddit. Access Pushshift API's Swagger UI documentation to explore methods for querying and retrieving Reddit data effectively. 7M rows) Split train (10. Pushshift will serve as the index of posts and How to extract and analyse different parts of Reddit Threads, Submissions and Comments with Pushshift's API. Pushshift also includes several AI Quick Summary The Pushshift Reddit dataset offers a comprehensive, real-time collection of Reddit data, including historical data from Reddit's inception, to facilitate social media 文章浏览阅读1. Nodes are Reddit users Here's what RedditHarbor does: Connects directly to Reddit API and downloads submissions, comments, user profiles etc. pushshift-reddit like 0 Dataset card FilesFiles and versions Community Dataset Viewer Auto-converted to Parquet API fddemarco--pushshift-reddit (122M rows) Split train (122M rows) Submit Downloading large amounts of Reddit data I really want to start learning more NPL. io创建的,自2015年以来收集并提供给研究人员的Reddit数据集。 该数据集实时更新,包含Reddit自成立以来的历史数据。 除了每月的数据转储 We’re on a journey to advance and democratize artificial intelligence through open source and open science. io API简介 Pushshift. I'm a novice in NLP in general and would love to get some advice on how to do this. io and to then extract the comments for a particular The PushShift Reddit dataset, which makes entire dumps of Reddit available on a regular schedule, is also made available without a license (to our knowledge). Pushshift is a social media data collection, analysis, and archiving platform that since 2015 has collected Reddit data and made it We’re on a journey to advance and democratize artificial intelligence through open source and open science. 7M Pushshift Archive ~ 2005-06 to 2023-03 Pushshift was a social media data collection, analysis, and archiving platform that since 2015 collected Reddit data Welcome! This repository explores the Pushshift Reddit Dataset, one of the most comprehensive, large-scale datasets available for analyzing online discourse, community behavior, and social trends on Contribute to Evan-Jiamg/Reddit-Dataset development by creating an account on GitHub. We take no responsibility for and we do These datasets are used in machine learning (ML) research and have been cited in peer-reviewed academic journals. 99TB. Click on your The Pushshift Reddit dataset makes it possible for social media researchers to reduce time spent in the data collection, cleaning, and storage phases of their TL;DR: Pushshift is in violation of our Data API Terms and has been unresponsive despite multiple outreach attempts on multiple platforms, and has not addressed pushshift-reddit like 0 Dataset card FilesFiles and versions Community main pushshift-reddit 1 contributor History:268 commits fddemarco Upload RS_2018-02_00. Pushshift is a social media data collection, analysis, and archiving platform that since 2015 has collected Reddit data and made it available to We’re on a journey to advance and democratize artificial intelligence through open source and open science. Example python scripts for parsing the data can be found here If Adding a Dataset Name: Reddit comments (2015-2018) Description: Reddit is an American social news aggregation website, where users can post links, and take part in discussions on these The Pushshift Reddit Dataset Jason Baumgartner1,*, Savvas Zannettou2, , Brian Keegan3, Megan Squire4, Jeremy Blackburn5, , , 1Pushshift. Example python scripts for parsing the data can be found here If These are from the pushshift dumps from 2005-06 to 2024-12 which can be found here These are zstandard compressed ndjson files. The In addition to monthly dumps, Pushshift provides computational tools to aid in searching, aggregating, and performing exploratory analysis on the entirety of the dataset. Announcing PullPush, a successor and further development of Pushshift. Social media data Presentation of the peer-reviewed paper:Jason Baumgartner, Savvas Zannettou, Brian Keegan, Megan Squire, Jeremy Blackburn. I will Pushshift's Reddit dataset is updated in real-time, and includes historical data back to Reddit's inception. parquet ce05aed about 1 month Step 2: Install HuggingFace libraries: Open a terminal or command prompt and run the following command to install the HuggingFace libraries: Pushshift is not perfect, just like everything else in this universe For one thing, there are a couple days delay on the Pushshift dataset — meaning that the latest Reddit data you can grab Explore how to use Hugging Face's RoBERTa model for sentiment analysis on Reddit posts to gain insights from online discussions! How to Use Pushshift with the Official Reddit API Use PSAW (installed earlier) to query Pushshift and get back reddit API PRAW objects. Datasets are an integral part of the field of machine learning. In this paper, we present the Pushshift Reddit dataset. 3 Pushshift - Reddit API The Pushshift Reddit API, offers expansive access to Reddit’s historical data, bypassing the latter’s limitations on data recency and query volume. Pushshift: Is a social media data collection, analysis, and archiving platform that has collected Reddit data and made it available to researchers. This article surveys research works in the quickly advancing field of instruction tuning (IT), a crucial technique to enhance the capabilities and The Pushshift Reddit Dataset We provide a small sample of the Pushshift Reddit dataset. I'm looking to scrape some Reddit posts for a personal research project and have heard secondhand The following codes will not work sooner or later. io/reddit/submissions/ Yeah, sorry, it's half a terabyte of data and you have to We’re on a journey to advance and democratize artificial intelligence through open source and open science. The pushshift. The In this paper, we present the Pushshift Reddit dataset. - wlgfour/reddit_scraper About Dataset This is a scrape of Reddit posts obtained via the BigQuery PushShift API These are from the pushshift dumps from 2005-06 to 2023-12 which can be found here These are zstandard compressed ndjson files. io is only provided to subreddit moderators Explore datasets powering machine learning. Auto-converted to Parquet API Embed Full Screen Viewer TL;DR: Pushshift is in violation of our Data API Terms and has been unresponsive despite multiple outreach attempts on multiple platforms, and has not addressed Remove the body_html and selftext_html fields. The Pushshift Reddit dataset makes it possible for social media researchers to reduce time spent in the data collection, cleaning, and storage phases of their projects. Big thank you to FlyingPackets for providing that data. pushshift. Pushshift is a social media data collection, analysis, and archiving platform that since 2015 has collected Reddit data and made it available to Pushshift Reddit Dataset – r/AskHistorians Hey everyone (: So my PhD mentor and I have been working with all comments and submissions from r/AskHistorians, since the beginning of the subreddit (2011). io Reddit API was designed and created by the /r/datasets mod team to help provide enhanced functionality and search capabilities It provides a small sample of the Pushshift Reddit dataset. org In addition to monthly dumps, Pushshift provides computational tools to aid in searching, aggregating, and performing exploratory analysis on the entirety of the dataset. There isn't really "one place" for data that I know of. This paper details the Pushshift platform's technical infrastructure and extensive Reddit dataset that advances social media research. py decompresses and iterates over a single zst 🤗 Datasets is a lightweight library providing two main features: one-line dataloaders for many public datasets: one-liners to download and pre-process any of the major public datasets (in 467 languages [Question] Anybody know of a PushShift dataset mirror? Given the principle of 3-2-1, I was wondering if there was a mirror or not for https://files. A 3rd party service to keep 3rd party apps running. Pushshift is a social media data collection, analysis, and archiving platform that since 2015 has collected Reddit data and made it For this reason, I have to download the complete dataset titled "Reddit comments/submissions 2005-06 to 2022-12," which amounts to 1. TL;DR: Pushshift as mentioned in this paper is a social media data collection, analysis, and archiving platform that since 2015 has collected Reddit data and made it available to researchers Pushshift's Pushshift is a social media data collection, analysis, and archiving platform that since 2015 has collected Reddit data and made it available to researchers. About Dataset This is a scrape of Reddit posts obtained via the BigQuery PushShift API In this paper, we present the Pushshift Reddit dataset. An alternative to PRAW. About Tools for downloading, decompressing, and processing Reddit data from the Pushshift API into a MySQL database. We’re on a journey to advance and democratize artificial intelligence through open source and open science. Reddit Political Discourse Dataset Data Source Pushshift Archive: Pushshift is a social media data collection, analysis, and archiving platform that has collected About Making Reddit data accessible to researchers, moderators and everyone else. Preface The pushshift. Pushshift is a social media data collection, analysis, and archiving platform that since 2015 has collected Reddit data and made it available to Datensatz DATENSATZ AKTIONEN EXPORT EndNote (UTF-8) BibTeX JSON eSciDoc XML MarcXML pdf docx (MS Word, Open Office) html (unformatiert) html (verlinkt) JSON Snippet eSciDoc Snippet Reddit comments and submissions from 2005-06 to 2023-09 collected by pushshift and u/RaiderBDev. Each line is a JSON object of the following format: In addition to monthly dumps, Pushshift provides computational tools to aid in searching, aggregating, and performing exploratory analysis on the entirety of the dataset. How to Scrap Reddit using pushshift. There are over four billion comments and submissions available via the Currently, data is copied into Pushshift at the time it is posted to reddit. You could scrape, or you could use the data that has been kindly made available Without direct database access, suggest you use the Pushshift submission dumps https://files. io via Python In early 2018, Reddit made some tweaks to their API that closed a previous method for pulling an entire Subreddit. I define “large” as a set of The pushshift. Initial pushshift-reddit-comments like 1 Dataset card FilesFiles and versions Community Dataset Viewer Auto-converted to Parquet API Subset default (1. In addition to monthly dumps, Pushshift provides computational tools to aid in searching, aggregat-ing, and performing exploratory analysis on the entirety of the dataset. io/ I know there's a set on BigQuery, but I'm also Create a repository ¶ A repository hosts all your dataset files, including the revision history, making it possible to store more than one dataset version. For anyone not familiar, these are the old pushshift dump files published by Stuck_In_the_Matrix through March 2023, then the rest of the year published by u/raiderbdev. Pushshift's Reddit dataset is updated in real-time, In addition to monthly dumps, Pushshift provides computational tools to aid in searching, aggregating, and performing exploratory analysis on the entirety of the dataset. 7M rows) Submit Separate dump files for the top 40k subreddits, through the end of 2023 Since the API changes last year, is there any way to access Reddit data for academic research? Pushshift. The Pushshift Reddit dataset In this paper, we present the Pushshift Reddit dataset. The Pushshift Reddit dataset Source Data The Reddit PushShift data dumps are part of a data collection effort which crawls Reddit at regular intervals, to extract and keep all its data. I’m planning to upload around 50GB of CSV files to my huggingface dataset and I wonder what’s the proper to push them? Should we use push_to_hub, or git lfs? and what’s the proper way Pushshift mainly separates the data into 2 broad endpoints, comments and submissions. In this article, I’m going to show you how to use Pushshift to scrape a large amount of Reddit data and create a dataset. Example python scripts This repo contains example python scripts for processing the reddit dump files created by pushshift. The files can be torrented from here. 该数据集包含50个高质量Reddit子版块的提交内容,这些内容是从Reddit PushShift数据转储中提取的(时间跨度为2006年至2023年1月)。数据集的结构包括多个子版块的分割,每个分割对 Pushshift is a free resource and can be used to collect data from Reddit, which is updated in real-time, but it also includes historical data, dating back to Reddit's inception. io and the Reddit API. Therefore, scores and other meta such as edits to a submission's selftext or a comment's body field may not reflect what is pushshift-reddit like 0 Dataset card FilesFiles and versions Community Dataset Viewer (First 5GB) Auto-converted to Parquet API Go to dataset viewer Viewer Subset default (10. Search or download archived reddit data. arctic-shift. Original content by Reddit users. The sample consists of two files: RS_2019-04. Pushshift is a social media data collection, analysis, and archiving platform that since 2015 has collected Reddit data and made it available to Reddit评论数据集包含了50个高质量子论坛的评论,数据来源于Reddit PushShift数据转储(2006年至2023年1月)。该数据集支持文本生成、语言建模和对话建模等任务。每个数据分割对应 In addition to monthly dumps, Pushshift provides computational tools to aid in searching, aggregating, and performing exploratory analysis on the entirety of the dataset. That Make Your First Reddit API Call (Easy Way) To call the Reddit API and extract the data, we will use an API called Pushshift. I've been converting the zst compressed ndjson files into a Pushshift is a social media data collection, analysis, and archiving platform that since 2015 has collected Reddit data and made it available to researchers. All URLs used to request from the database with begin by specifying either a comment or submission Reddit (Title, Body)-Pairs This dataset contains jsonl-Files about (title, body) pairs from Reddit. 4k次,点赞4次,收藏7次。探索Pushshift Reddit API:解锁Reddit数据的无限可能在互联网的信息海洋中,Reddit是一个无尽的知识宝库,涵盖各种主题的讨论和分享。为了 The parquet-converter bot has created a version of this dataset in the Parquet format in the refs/convert/parquet branch. Pushshift is a social media data collection, analysis, and archiving platform that since 2015 Contribute to quannh08/Massive-dataset-mining development by creating an account on GitHub. The huggingface dataset implementation would freeze for hours scanning the image folder, and trying to modify the behaviour of the dataset class was a huge headache. Pushshift also includes several Dataset Card for Reddit threads Dataset Summary The Reddit threads dataset contains 'discussion and non-discussion based threads from Reddit which we collected in May 2018. Why Pushshift API over the Reddit official API (PRAW)? The Reddit API (PRAW) provides access Pushshift is a data collection and analysis platform that specializes in archiving and indexing social media data for research purposes. Major advances in I’m planning to upload around 50GB of CSV files to my huggingface dataset and I wonder what’s the proper to push them? Should we use push_to_hub, or git lfs? and what’s the proper way We would like to show you a description here but the site won’t allow us. The project is divided into two main phases: Data We’re on a journey to advance and democratize artificial intelligence through open source and open science. 85B rows) Join the discussion on this paper page Welcome! This repository explores the Pushshift Reddit Dataset, one of the most comprehensive, large-scale datasets available for analyzing online discourse, community behavior, and social trends on The Pushshift Reddit dataset makes it possible for social media researchers to reduce time spent in the data collection, cleaning, and storage pushshift-reddit-comments like 0 Dataset card FilesFiles and versions Community main pushshift-reddit-comments /data 1 contributor History:276 commits fddemarco Upload RC_2016-02. In this comprehensive guide, we’ll This is a very basic R package for fetching Reddit data using the pushshift API. In addition to monthly dumps, Pushshift provides computational tools to aid in searching, aggregating, and performing exploratory analysis on the entirety of the dataset. 4. io Reddit API was designed and created by the /r/datasets mod team to help provide enhanced functionality and search capabilities for searching We’re on a journey to advance and democratize artificial intelligence through open source and open science. Install Learn how to overcome the limitations of Reddit's API by utilizing Pushshift and the PRAW package for efficient and comprehensive data retrieval. The Pushshift Reddit I'm going to miss pushshift, their service was valuable for catching reddit moderators performing underhanded censorship of posts they didn't agree with. 7M Currently, data is copied into Pushshift at the time it is posted to reddit. Pushshift Reddit Dataset是由Pushshift. Usually it comes coupled with something. It is particularly known for its extensive collection of Reddit data. Uploading your dataset to In addition to the raw data, we also provide the source code used to collect it, allowing researchers to run their own data collection instance. At present, the package should suit general users, but is not a general package. Therefore, scores and other meta such as edits to a submission's selftext or a comment's body field may not reflect what is Pushshift Reddit API v4. TERMS OF USE By utilizing Pushshift to access any Reddit, Inc. Their thoughtful and careful examination highlighted the fact that 数据集介绍 简介 Pushshift 提供了 2005 年 6 月至 2019 年 4 月期间在 Reddit 上发布的所有提交和评论。该数据集包含 651,778,198 条提交和发布在 2,888,885 个子版块上的 5,601,331,385 条评论。 引文 Source & License Repackaged from Arctic Shift monthly dumps, which re-process the PushShift Reddit archive. io. pushshift-reddit-comments like 15 Modalities: Tabular Text Formats: parquet Size: 1B - 10B Libraries: Datasets Dask Croissant + 1 Dataset card Data Studio FilesFiles and versions Community 1 main The Pushshift Reddit dataset offers comprehensive Reddit data for researchers, updated in real-time and including historical data since its inception. io API 是一个强大的工具,它使得开发者能够轻松访问和利用来自Reddit平台的庞大数据资源。 作为数据挖掘和 This package is intended to assist with downloading, extracting, and distilling the monthly reddit data dumps made available through pushshift. (“Reddit”) data or data API (the “Reddit Data API”), user certifies that they are a registered user of Reddit and a Reddit moderator (a “Mod") Need help to make the dataset viewer work? Make sure to review how to configure the dataset viewer, and open a discussion for direct support. Pushshift is a social media data collection, analysis, and archiving platform that since 2015 has collected Reddit data and made it Bibliographic details on The Pushshift Reddit Dataset. zst: All Reddit submissions that were posted during In this paper, we present the Pushshift Reddit dataset. Explore the Pushshift Reddit Dataset, a comprehensive archive designed to overcome API limitations and power reproducible social media research. The 🤗 Datasets is a lightweight library providing two main features: one-line dataloaders for many public datasets: one-liners to download and pre-process any of the Reddit Dataset Update Recently, Gaffney and Matias shared their findings regarding missing data in the pushshift. 85B rows) Split train (1. Pushshift is a social media data collection, analysis, and archiving platform that since 2015 has collected Reddit data and made it Pushshift Reddit Dataset是由Pushshift. The easiest way to use the API is Pushshift Reddit Dataset is a comprehensive archive of Reddit posts and comments that enables large-scale analysis in the post-API era. 🤗 Datasets is a lightweight library providing two main features: one-line dataloaders for many public datasets: one-liners to download and pre-process any of the major public datasets (image datasets, This repository contains the source code for an end-to-end Big Data pipeline and data mining project focusing on the massive Reddit Pushshift Dataset. We believe the Pushshift Telegram dataset can We would like to show you a description here but the site won’t allow us. In addition to monthly dumps, With this API, you can quickly find the data that you are interested in and discover interesting correlations within the data. 53uqxiw bhu ymk52v a9eelg u7bep cx5mf7 lxg7z imhx6 er5 4fmv