Tag: Parquet

  • Spark, Cassandra and Python

    In this post we touch briefly on Apache Spark as a cluster computing framework that supports a number of drivers to pipe data in, and…

    Read
  • Intro to Big Data Projects

    Modern applications produce super large datasets beyond what traditional data-processing application can handle. Big data is a discipline that specialize in processing such data. For…

    Read