请问在读取 hdfs 文件的时候,采用分块 chunksize 读取数据,但怎么会把一条数据拆分成多条呢?
with client.read(full_path,encoding='utf-8',chunk_size=10000) as reader:
for piece in reader:
piece=piece.split('\n')
for line in piece:
print(line)
本来数据是 2018-05-01|weorjerjsfj|worjwelfjs|
结果读出来的数据是 2018-05-01|weo
rjerjsfj|worjwelfjs|分别显示了两条记录
with client.read(full_path,encoding='utf-8',chunk_size=10000) as reader:
for piece in reader:
piece=piece.split('\n')
for line in piece:
print(line)
本来数据是 2018-05-01|weorjerjsfj|worjwelfjs|
结果读出来的数据是 2018-05-01|weo
rjerjsfj|worjwelfjs|分别显示了两条记录