-
Notifications
You must be signed in to change notification settings - Fork 846
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[BUG] cudf.Series.duplicated returns error 'Series' object has no attribute 'duplicated' #15777
Comments
Thanks for the report. Could you share what version of cudf you are running as well as a reproducible example? For example, this works on cudf 24.04 >>> import cudf
>>> cudf.Series(list("121")).duplicated()
0 False
1 False
2 True
dtype: bool
>>> cudf.__version__
'24.04.00' |
thank you for the reply cudf.version says '22.10.01+2.gca9a422da9' again this is the Paperspace cloud service's RAPIDS image. I will come back with a reproducible code later today. |
I do not see It appears |
thank you for the clarification |
Closing as the OP was the expected behavior given the cudf version |
Describe the bug
I am trying to see if there are duplicate values in a feature within a dataframe using duplicated()
Steps/Code to reproduce bug
first i tried using the duplicated() on the column itself
df['job_title'].duplicated()
then explicitly made it a series of string values then ran duplicated().
`dups = cudf.Series(df['job_title']).astype('string')
dups = dups.duplicated()
`
in these cases i get the error: 'Series' object has no attribute 'duplicated'
Expected behavior
Something like this, where 'True' means duplicate of something that came before:
0 False
1 False
2 True
3 False
4 True
dtype: bool
Environment overview (please complete the following information)
Environment details
Please run and paste the output of the
cudf/print_env.sh
script here, to gather any other relevant environment detailsI couldn't run the print_env.sh, wasn't found in the directory
nvidia-smi says: NVIDIA-SMI 525.116.04 Driver Version: 525.116.04 CUDA Version: 12.0
I was using a P5000 on paperspace
Additional context
Add any other context about the problem here.
The text was updated successfully, but these errors were encountered: