Hello! I have a dataset of around 2m tweets.
I'm interetsed in deleting duplicate tweets (because they are likely bot-produced). That is, some tweets have identical content.
Can anyone help me do that?
Check this article out, it might help you out
Removing Duplicate Data with LOD Calculations | Tableau Software
Retrieving data ...