Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

how to run ? #76

Open
Saurhub69 opened this issue Dec 16, 2021 · 2 comments
Open

how to run ? #76

Saurhub69 opened this issue Dec 16, 2021 · 2 comments

Comments

@Saurhub69
Copy link

i don't want to run this code on google cloud i just want it till ("Extract the data and split it into shards")
but i don't know how to do it can someone explain me how to run this commands

this one

PROJECT="your-google-cloud-project"

DATADIR="gs://${BUCKET?}/opensubtitles/$(date +"%Y%m%d")"

python opensubtitles/create_data.py
--output_dir ${DATADIR?}
--sentence_files gs://${BUCKET?}/opensubtitles/raw/lines/lines-*
--runner DataflowRunner
--temp_location ${DATADIR?}/temp
--staging_location ${DATADIR?}/staging
--project ${PROJECT?}
--dataset_format TF

@duongkstn
Copy link

create account in google cloud,
create a project and enable bigquery, google cloud storage.
These commands should be ran on Google clould console

@Saurhub69
Copy link
Author

hello dyoungkstn i am asking i want to run this whole project on vs code (on my local machine ) but things showing error
at first it was showing tensorflow error ,then apache baem and right now assertion error can you tell me how to run this project locally

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants