Dont tell me your data is bigdata till you have encountered errors like these.
On side note if you get this error or your favourite tooling is not able to handle the dataset its time to upgrade your toolset.
my general choice ranges so far are :
sheet/excel -> grep/awk/cut/sort/uniq -> rdbms -> nosql_dataset
If it can be done be sorted by a pin dont bring in the sword.