Solid Hadoop platform knowledge, including Spark, Mapreduce, Hive and HBASE.
Solid knowledge of Unix / Linux operating systems;
Solid knowledge in hardware design, network architectures, TCP / IP, DNS, Firewalls;
Solid knowledge in Shell and SQL languages;
Solid knowledge in Java, Python, or Scala (more than one is desirable);
Solid knowledge in Data Modeling, Data Warehousing and MPP databases (Greenplum, Teradata, Netezza);
Solid conceptual knowledge of NoSQL and practical in some solution (Hbase, Cassandra, Riak, Redis, MongoDB, etc).
o Desirable qualifications:
Authentication mechanisms such as Kerberos and OpenLDAP;
Automation tools like Chef, Puppet, Ansible.
ETL, BI and Visualization tools (Pentaho, Cognos, Powercenter, SAS, Microstrategy, Tableau).
Integration and messaging architectures;
Development of web applications, backends and REST APIs.
Development pipelines and DevOps
#design #architecture #devops #java #python #integration #hadoop