Skip to content

Latest commit

 

History

History
203 lines (183 loc) · 10.1 KB

CHANGELOG.md

File metadata and controls

203 lines (183 loc) · 10.1 KB

Change Log

All notable changes to this project will be documented in this file.

Unreleased

Added

  • PNDA-3330: Add default application user configuration to the deployment manager.
  • PNDA-2389: PNDA automatically reboots instances that need rebooting following kernel updates
  • PNDA-2982: Added support for adding pyspark application dependencies
  • PNDA-1960: Make Kafkat available on nodes as option for Kafka management at CLI
  • PNDA-2445: Support for Hortonworks HDP hadoop distro
  • PNDA-2163: Support for OpenTSDB Platform testing
  • PNDA-2832: Added sql magic support for Jupyter notebooks
  • PNDA-1788: Cloudera version can be set in the salt pillar
  • PNDA-3314: Added new flavor for larger PNDAs called "production"

Changed

  • PNDA-2965: Rename cloudera_* role grains to hadoop_*
  • PNDA-3216: Uprev to logstash 5.2.2
  • PNDA-3180: Limit orchestrate commands to new nodes only
  • PNDA-3212: Link logstash install directory using salt file.symlink command as the cmd.run version was preventing logshipper/logserver upgrades
  • PNDA-3249: Upgrade Kafka version to 0.11.0.0
  • PNDA-3264: Use redis 3.2.10 on redhat
  • PNDA-2884: Upgrade CDH and Cloudera Manager version 5.12.1
  • PNDA-3380: Move opentsdb log to /var/log/pnda
  • PNDA-3441: Cleanup warnings from create_notebook_dir.sh script

Fixed

  • PNDA-3213: fix issue on wrong checksum file name for logserver sls
  • PNDA-3615: conda command now works 'out-of-the-box' with correct PATH additions
  • PNDA-3216: Use new logstash plugin mechanism in 5.2.2 that actually works when offline
  • PNDA-3111: Report failures up if opentsdb.hbase_tables fails
  • PNDA-3309: use local gem installation for Kafka tool
  • PNDA-3343: When expanding a cluster new datanodes are given a spark gateway role
  • PNDA-3309: Write CM_SETUP_SUCCESS into a fixed directory
  • PNDA-3369: fix issue on offsets topic replication factor on kafka configuration zhere default value is 3
  • PNDA-3238: Add jupyter extensions to the kenel virtual environment.
  • PNDA-3350: Fix dm.pem permission post deployment highstate.
  • PNDA-3432: Jupyter not launching after reboot on RHEL.

[2.0.0] 2017-05-23

Added

  • PNDA-2375: Isolate PNDA from breaking dependency changes
  • PNDA-2456: Support for Redhat 7
  • PNDA-2480: Added a per flavor pillar setting for kafka log retention (log.retention.bytes) set to 300MB (pico) 1GB (standard) to stop disks filling up on pico clusters.
  • PNDA-2682: review console backend deployment
  • Add a simple jupyter notebook
  • Allow salt mine for all interfaces

Changed

  • PNDA-2446: Download java with wget
  • PNDA-2517: If Cloudera setup (cm_setup.py) fails, orchestrate can be re-run and cm_setup.py will attempt to continue from where it completed up to last time. Progress is recorded in /root/.CM_SETUP_SUCCESS which can be edited if manual control is required over the point to continue from.
  • PNDA-2577: Use spur 0.3.20 for cm_setup.py
  • PNDA-2596: Stop ingesting internal PNDA testbot topic
  • PNDA-2672: Explicitly set CM API version number
  • PNDA-2679: Set virtual env for impala-wrapper
  • PNDA-2691: Install nodejs/npm from deb/rhel packages
  • PNDA-2717: Remove pypi default URL
  • PNDA-2721: Add spark gateway roles to datanodes
  • PNDA-2756: Move Cloudera Manager installation in orchestrate stage instead of highstate stage
  • PNDA-2758: Add a wait on elasticsearch running for kibana-dashboard
  • PNDA-2787: Write cm_setup.log to /var/log/pnda instead of /tmp
  • PNDA-2808: Install PNDA platform-libraries on all CDH nodes instead of just the jupyter node.
  • PNDA-2810: Update boto library to 2.46.1 required to work with certain AWS regions (e.g. London)
  • PNDA-2817: Remove cloudera-keys sls
  • PNDA-2820: Refactoring of the installation of graphite-api
  • PNDA-2883: add auth_version to pr-config.json to set the swift keystone auth version associated with auth_url
  • PNDA-2885: Add gcc dependency for package-repository
  • PNDA-2903: Install node.js from tar.gz instead of deb package
  • PNDA-2966: Replace separate install_sharedlib.py with function in cm_setup.py
  • PNDA-2964: Stop using ec2 grains during deployment as it's not needed anymore
  • PNDA-2984: Upgrade JDK to 8u131
  • PNDA-2881: Update Kafka Manager version to 1.3.3.6
  • PNDA-2839: Update Grafana version to 4.2.0. Warning: the default pnda password has changed.
  • PNDA-2841: Update Logstash version to 5.0.2 for PNDA logshipper/logserver
  • PNDA-2838: Update OpenTSDB to version 2.3.0
  • PNDA-3085: Set timezone to UTC (UTC by default but can be configured with ntp:timezone pillar)
  • PNDA-3114: Install CDH platform testing modules after CDH has been set up.
  • Update versions of cloudera manager and redis
  • Add a flavor parameter to change kafka/zookeeper listening interface
  • Install cloudera manager agents manually
  • Explicitly set API version for CM

Fixed

  • PNDA-2710: Remove online URL for logstash
  • PNDA-2781 Fixes for redhat mirror usage
  • PNDA-2874: Install correct snappy compression libraries, so avro files can be viewed in HUE again
  • PNDA-3059: Use latest version of numerous base packages from distro
  • PNDA-3112: Multiline log messages from file input
  • PNDA-3129: Create log directory for gobblin which was missing and preventing log from being written.
  • Update console-frontend owner to allow nginx to read files
  • Update Elasticsearch/Kibana extraction to fix permission issues

[1.3.0] 2017-01-20

Added

  • PNDA-2533: Ability to create an ElasticSearch cluster for external usage
  • A new motd when login to nodes via SSH

Changed

  • PNDA-2121: Improve jupyter sls files
  • PNDA-2239: The 'pnda' user and 'pnda' group are able to write/delete files written by Gobblin which now runs as 'pnda' user instead of 'gobblin' user
  • PNDA-2467: Put platform-data-mgmt in a virtualenv
  • PNDA-2468: Put platform-deployment manager in a virtualenv
  • PNDA-2469: Put platform-package-repository in a virtualenv
  • PNDA-2484: Put hdfs-cleaner in a virtualenv
  • PNDA-2485: Pin all versions of pnda python components and upgrade the version of libraries
  • PNDA-2489: Remove _modules/zk.py file and move it to _modules/pnda.py and do simple REST queries instead of using cm-api
  • PNDA-2542: Put elastic-search-curator in a virtualenv
  • PNDA-2544: Download Zookeeper from top level mirror instead of US mirror
  • PNDA-2547: Put impala-shell in a virtualenv
  • PNDA-2550: kibana dashboard are now imported without elasticdump, which simplifies the installation
  • PNDA-2551: Put Jupyter python components in a virtualenv
  • PNDA-2552: Put graphite of the console in a virtualenv
  • PNDA-2560: Put cm_setup cloudera installation script in a virtualenv
  • PNDA-2598: Put pnda_restart script in a virtualenv

Fixed

  • PNDA-2494: Concurrency issue during deployment related to master dataset creation
  • PNDA-2498: Deployment-manager was passing the wrong thrift server to the Happybase library
  • PNDA-2511: Pin version of nodejs to avoid installation failure of kibana
  • PNDA-2543: Make the creation of the cloudera manager external database idempotent

[1.2.0] 2016-12-12

Added

  • PNDA-2250: Provide tools that allow pnda infra to be rebooted and the services started again
  • Aggregating redundant services into common VMs (anti-affinity on OpenStack)

Changed

  • PNDA-2284: hdfs-cleaner creates the archive container it needs
  • PNDA-1918: Simplify component paths
  • PNDA-2392: Refactor hue user creation
  • Update CDH to 5.9.0
  • PNDA-1812: Add a re-apply config mode to cm_setup
  • Merge general zookeeper/kafka white and black box tests
  • PNDA-2487: Increase HBase heaps for CDH5.9
  • PNDA-2496: CM now uses external MySQL database instead of embedded postgres db

Fixed

  • PNDA-2231: Don't fail if the pnda user is already created in grafana
  • Change 'heap_dump_directory_free_space' warnings for PICO flavor
  • Update logshipping for gobblin
  • PNDA-2431: Reduce impala catalog server heap size for pico
  • Fix UTF8 issue on master-dataset
  • PNDA-2434: Pin version of ES curator and specify full path
  • PNDA-2435: Reduce ES data retention for pico
  • Create a python virtualenv for platform testing
  • PNDA-2488: Harmonize heap dump warning and reduce firehose size
  • Fix issue on DM config as OpenTSDB configuration should be IP:PORT not a link
  • Alter YARN parameters to give 512MB map tasks
  • PNDA-2487: Adjust hbase, yarn and mapred to better fit pico

[1.0.1] 2016-10-31

Fixed

  • PNDA-2368: Include console-backend 0.2.2 to fix version of the redis-parser npm module to 2.0.4

[1.0.0] 2016-10-21

Added

  • Multi-flavor mechanism, with pico flavor
  • PNDA-2320 Kafka manager port is now in pillar
  • PNDA-2272 review formulas in order to ensure no issue on deployment even if there is not all roles
  • PNDA-2233 Jupyter notebook plugin added to deployment manager

Fixed

  • Some logs were not in /var/log/pnda and so, were not shipped to the logserver
  • The 'bulk' directory in HDFS is now owned by the 'pnda' user
  • Prevent Gobblin from ingesting internal kafka __consumer_offsets topic

Changed

  • ntp:servers pillar default was removed, but can still be set

[0.2.0] 2016-09-07

Added

  • Install Jupyter-Spark extension
  • Add default Grafana dashboards for PNDA metrics
  • PNDA-2010 Create Graphite datasource
  • Add a simple minimal notebook to explain basic Jupyter usage
  • Create PNDA OpenTSDB default datasource in Grafana
  • PNDA-820 Added graphite formula to salt

Fixed

  • PNDA-2022 Re-raise exceptions from cm_setup for fail fast behaviour
  • PNDA-2009 Tell upstart to not log console output of data-logger
  • PNDA-1933 CM setup waits on cloudera manager being available
  • Do not use http download for cloudera manager
  • Clean ups to Grafana states
  • Clean ups to OpenTSDB states
  • Clean ups to Jupyter states

Changed

  • PNDA-2012 Update to Grafana 3.1.1
  • PNDA-1485 Update OpenTSDB to 2.2.0
  • PNDA-2016 Update Zookeeper to 3.4.6
  • Update Kafka Manager to 1.3.1.6
  • Update Kafka to 0.10.0.1
  • Use saltenv instead of env in anticipation of upcoming Salt Boron
  • Automatically set test topic replication based on broker cluster size
  • Make the pnda user home directory configurable
  • For AWS, launch PNDA on private network
  • PNDA-1923 Add the yarn daemon and application logs to logserver
  • PNDA-2005 Add a state that creates a PNDA test topic on Kafka instead of using KM

[0.1.0] 2016-07-01

First version

  • Creates and configures all PNDA services