Fast Matrix Transposition

This repository contains functions with different approaches to Transposition of Matrix and performance tests of them.

We sequentially implement :

Naive matrix transposition (single-thread)
Parallel naive matrix transposition (multi-threads)
SSE matrix transposition (single-thread)
SSE Block matrix transposition (single-thread)
and fastest variant SSE Parallel Block matrix transposition (multi-threads)

Results of performance tests:

All tests were performed on "Intel(R) Core(TM) i7-8650U CPU @ 1.90GHz" with 16 GB of DDR4 RAM
As can be seen significant difference apears starting with matrix about 2000x2000

SSE Block matrix transposition approach faster than any other single-thread approaches
SSE Parallel Block matrix transposition fastest at all

How-to-start tests:

To start performance test build and run following:

FastMatrixTransposition [matrix_size] [block_size] [number_of_threads] [number_of_tests performance tests for each approach]

Will outputed average times for each approah

Example of output:

8000,194659.156250,97701.046875,140561.281250,86731.703125,62631.093750

Format of output:

[matrix_size],[naive approach], [parallel naive approach], [SSE matrix transposition], [Block transposition], [Block SSE parallel transposition]

All times in nano seconds

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
img		img
python		python
.gitignore		.gitignore
FastMatrixTransposition.cbp		FastMatrixTransposition.cbp
FastMatrixTransposition.depend		FastMatrixTransposition.depend
FastMatrixTransposition.layout		FastMatrixTransposition.layout
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
fmt.cpp		fmt.cpp
fmt.h		fmt.h
main.cpp		main.cpp

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Fast Matrix Transposition

Results of performance tests:

How-to-start tests:

To start performance test build and run following:

Example of output:

Format of output:

About

Releases

Packages

Languages

License

l0andr/fmt

Folders and files

Latest commit

History

Repository files navigation

Fast Matrix Transposition

Results of performance tests:

How-to-start tests:

To start performance test build and run following:

Example of output:

Format of output:

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages