-
Notifications
You must be signed in to change notification settings - Fork 572
Tpetra FY18 Plan
DELIVERY: FY18 Q1; POC: Devine
Identification of performance, maintainability, quality issues; feasibility analysis; priority setting
DELIVERY: FY18 Q2; POC: Siefert
Integration of KokkosKernels SpGEMM on GPUs; performance evaluation
MPI+X improvements, including algorithmic improvements ported from Epetra
DELIVERY: FY18 Q2 -- identification, repository, configure, build; FY18 Q4 -- automated testing; POC: Hu
- MueLu setup - use MueLu driver
- CG - use benchmark in Tpetra
- Assembly -- best practice code
- Import / Export
- Overlap - Ifpack2 Additive-Schwarz
- BlockCRS
DELIVERY: FY18 Q2; POC: Trott
DELIVERY: Evaluation in FY18 Q2; POC: Luchini
Benchmark MueLu vs ML set-up; identify potential benefit and semantic changes
DELIVERY: FY18 Q3; POC: Hoemmen
DELIVERY: FY18 Q3 for development and deprecation; FY19 Q1 for removal of deprecated code; POC: Trott
- Includes performance improvements for Matrix / Graph construction and fill
- Includes Deprecation of Dynamic Profile (insert operation that can fail, examples of new application usage, utilities for reallocation, support for knowing max # entries but not max # entries per row)
- Includes reducing unsafe data access by returning of unmanaged Kokkos::View
DELIVERY: FY18 Q3 for development and deprecation; FY19 Q1 for removal of deprecated code; POC: Hoemmen
- Fix GlobalOrdinal to be int64_t
- Allow compile-time option for LocalOrdinal: int32_t or int64_t
- Determine node type from compile-time option rather than template parameter
- Includes defining offset_type to be Kokkos::size_type
DELIVERY: FY18 Q4; POC: Fuller
- Distributor / Import / Export design and performance
- Use KokkosKernels interfaces to on-node TPLs
- Create Tpetra::initialize, Tpetra::finalize to enable correct handling of Kokkos, MPI
- Matrix-Matrix multiplication: enable separate symbolic / numeric usage
- Remove local solves from Tpetra
- Consolidate and make consistent use of arithmetic and communication Traits
- Remove CUDA_LAUNCH_BLOCKING=1 requirement
- Overlapping computation and communication
- Separation of concerns: e.g., Comm: Teuchos::Comm, iallreduce in Tpetra
- Feature testing of ROL- and Sacado-like data types
- Remove Tpetra::Map::getIndexBase() method
- Dual view semantics
- Support for limited graph changes
January 31, 2018
SAND2018-1063 O
Copyright © Trilinos a Series of LF Projects, LLC
For web site terms of use, trademark policy and other project policies please see https://lfprojects.org.
Trilinos Developer Home
Trilinos Package Owners
Policies
New Developers
Trilinos PR/CR
Productivity++
Support Policy
Test Dashboard Policy
Testing Policy
Managing Issues
New Issue Quick Ref
Handling Stale Issues and Pull Requests
Release Notes
Software Quality Plan
Compiler Warnings/Errors
Proposing a New Package
Guidance on Copyrights and Licenses
Tools
CMake
Doxygen
git
GitHub Notifications
Mail lists
Clang-format
Version Control
Initial git setup
'feature'/'develop'/'master' (cheatsheet)
Simple centralized workflow
Building
SEMS Dev Env
Mac OS X
ATDM Platforms
Containers
Development Tips
Automated Workflows
Testing
Test Harness
Pull Request Testing
Submitting a Pull Request
Pull Request Workflow
Reproducing PR Errors
Addressing Test Failures
Trilinos Status Table Archive
Pre-push (Checkin) Testing
Remote pull/test/push
PR Creation & Approval Guidelines for Tpetra, Ifpack2, and MueLu Developers