ysn2233
V2EX  ›  Hadoop

大数据环境中压缩格式用什么比较好?

  •  
  •   ysn2233 · Mar 12, 2020 · 3828 views
    This topic created in 2277 days ago, the information mentioned may be changed or developed.

    因为文件大小可能不一,需要支持 splittable 的,目前看到的貌似有 Bzip2 和 lzo (需要建索引),哪个相对比较好用?

    1 replies    2020-03-12 14:46:44 +08:00
    alya
        1
    alya  
       Mar 12, 2020
    snappy lz4 zstd
    About   ·   Help   ·   Advertise   ·   Blog   ·   API   ·   FAQ   ·   Solana   ·   2588 Online   Highest 6679   ·     Select Language
    创意工作者们的社区
    World is powered by solitude
    VERSION: 3.9.8.5 · 31ms · UTC 11:03 · PVG 19:03 · LAX 04:03 · JFK 07:03
    ♥ Do have faith in what you're doing.