Merge PMDK to Master #7

pyrito · 2020-03-29T17:31:28Z

Reason for this PR

Wanted to integrate PMDK changes to the master branch to ensure recoverability of data structures allocated in persistent memory. Currently still working on P-CLHT. Transactions are included but are yet to be verified. See issue #5

How to enable PM?

Install PMDK

$ cd pmdk
$ git checkout tags/1.6
$ make -j
$ cd ..

Emulate PM with Ext4-DAX mount

$ sudo mount -o dax /dev/pmem0 /mnt/pmem

Set pool_size and pool name in clht_lb_res.c. TODO: instructions to set up environment variables instead.
Make accordingly and run the example.

Checklist

Changes are clean and readable; make logical sense
Changes are correct, passes tests
API changes are reflected in README
Any required documentation is changed explicitly

Still some bugs to be hashed out: - Seg fault thrown when clht_gc_destroy is called - PM file unable to handle larger sizes for keys - Need to implement PM logic for buckets

Some bugs: - For larger sizes of n, still having pmemobj_alloc errors

vijay03

Could we change README.md to explain how to use the pmdk branch please?

Could we also add a new pmdk.md to explain the porting effort at a high level?

SeKwonLee

Need changes to properly use PMDK transactions or to employ non-transactional APIs instead only to resolve memory leak problems. There are two separate APIs provided by libpmemobj: non-transactional API and transactional. If you employ transactional API across the entire range of updates, we don't need to employ RECIPE conversions at all such as manually adding flushes after each atomic store. PMDK transaction would transparently handle everything to ensure the crash-consistent updates of data structures by using undo logging. Instead of employing transactional API, we may be able to use mini-transactional features provided in non-transactional APIs to handle memory leak problems. (Please check following these links: link1, link2, link3).
Another issue we need to address is the default PMDK pointer size (16 bytes). Different from traditional dynamic 8-byte pointers, libpmemobj uses 16-byte pointers consisting of uuid and offset, which indicate the universal identifier (8 bytes) of memory-mapped files and the offset (8 bytes) from the starting virtual address of the memory-mapped file respectively. If we simply replace 8-byte pointers with 16-byte PMDK pointers, we can lose the benefits of the cache-efficient layout of original RECIPE indexes. For example, in our current modification, we simply replace the "next" pointer in bucket_t structure with PMEMoid (16bytes) type variable. After this modification, the bucket size is changed from 64 bytes (cache line size) to 72 bytes, so this will lead bucket allocations unaligned by cache lines. uuid is used to distinguish different memory pools and should be employed for the specific applications allocating objects from multiple memory pools. However, as RECIPE indexes only need to use a single memory pool, we can initialize uuid globally at program startup and can store offsets only as pointers in the objects of data structures.

vijay03 · 2020-03-31T15:00:00Z

Hi @pyrito, I think I had a different pull request in mind, just talking about the existence of the PMDK branch in README + explaining how to setup and use PMDK with Recipe. The actual details of the implementation can stay within this branch, so regardless of the implementation, master's README.md should have a pointer to this branch.

pyrito · 2020-03-31T17:53:13Z

Hi @vijay03 ,

Was your intention to have another branch with README modifications instead?

vijay03 · 2020-03-31T17:55:36Z

no, directly in master. So you would have to submit a different pull request.

Edited the main README to refer to the PMDK branch in the "limitations" section. Created a PMDK README for P-CLHT.

1. Merge recent masstree modifications from master

1. Add system requirements: the huge performance drop in PMDK was observed in old kernel versions (v4.X). After changing the kernel to latest version (v5.3), it becomes performing well. 2. Change the instructions of installing pmdk to use latest-stable branch.

pyrito · 2020-04-18T02:20:00Z

No need to merge the code. Branches are made to be separate.

pyrito added 12 commits March 6, 2020 15:41

Initial pmdk pushes for clht_lb_res

f0e2205

Refactored & bug

8aaf5c2

Fixed seg fault

922b912

Main logic working with PMDK

1d56564

Still some bugs to be hashed out: - Seg fault thrown when clht_gc_destroy is called - PM file unable to handle larger sizes for keys - Need to implement PM logic for buckets

Implemented PM clean-up, solved segfault

1982610

Some bugs: - For larger sizes of n, still having pmemobj_alloc errors

Modified clean-up in clht_gc.c, found resizing bug

298bf59

Fixed PMEMpool size, still having segfault

6a5a4d4

Solved seg fault for hashtable. WIP

b3ead0c

Replicated bug

b859a6b

Fixed seg fault, still issue with ssmem

5ec9a47

Changed clht_open func param

2e07aca

First implementation of transactions

fd33b64

pyrito requested a review from SeKwonLee March 29, 2020 17:31

vijay03 requested changes Mar 30, 2020

View reviewed changes

SeKwonLee requested changes Mar 30, 2020

View reviewed changes

Trimmed use of transactions. WIP.

824f5e2

pyrito and others added 11 commits March 31, 2020 21:16

Working on cache alignment fix. WIP.

bd951e6

Cache-line alignment fixed.

9b605d9

Added README information for PMDK

5dda496

Edited the main README to refer to the PMDK branch in the "limitations" section. Created a PMDK README for P-CLHT.

Added transaction configurability

bbdd250

Documentation changes

f4243fe

Merge branch 'master' into pmdk

03c7b32

1. Merge recent masstree modifications from master

Initial commit applying PMDK to masstree

fb11b10

Integrated correct transactions

9782868

Merge branch 'pmdk' of https://github.com/utsaslab/RECIPE into pmdk

4f6a839

[P-Masstree] Update compile options and minor changes

77bc50d

Update pmdk document

c8182e3

1. Add system requirements: the huge performance drop in PMDK was observed in old kernel versions (v4.X). After changing the kernel to latest version (v5.3), it becomes performing well. 2. Change the instructions of installing pmdk to use latest-stable branch.

SeKwonLee and others added 13 commits April 12, 2020 18:34

[P-CLHT] update compile options

1ec6380

Fixed bug in clht_lb_res.c

3d82a49

Merge branch 'master' into pmdk

af8106d

✨ add the new feature to reload masstree

ed00f17

Merge branch 'master' into pmdk

0405b2d

Merge branch 'master' into pmdk

8d3b00a

Exchange free to pmemobj_free

270bb09

Added some previously deleted code

3514403

Merge branch 'pmdk' of https://github.com/utsaslab/RECIPE into pmdk

7006004

Integrate clht_open to clht_create

8d7af8a

Remove comments related to DIMM

d495e35

Merge branch 'master' into pmdk

058e0b1

Modified README for P-CLHT

03af92d

pyrito closed this Apr 18, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Merge PMDK to Master #7

Merge PMDK to Master #7

pyrito commented Mar 29, 2020 •

edited by vijay03

Loading

vijay03 left a comment

SeKwonLee left a comment

vijay03 commented Mar 31, 2020

pyrito commented Mar 31, 2020

vijay03 commented Mar 31, 2020

pyrito commented Apr 18, 2020

Merge PMDK to Master #7

Merge PMDK to Master #7

Conversation

pyrito commented Mar 29, 2020 • edited by vijay03 Loading

Reason for this PR

How to enable PM?

Checklist

vijay03 left a comment

Choose a reason for hiding this comment

SeKwonLee left a comment

Choose a reason for hiding this comment

vijay03 commented Mar 31, 2020

pyrito commented Mar 31, 2020

vijay03 commented Mar 31, 2020

pyrito commented Apr 18, 2020

pyrito commented Mar 29, 2020 •

edited by vijay03

Loading