Skip to content

์•„๋ผ

arabae edited this page Jun 2, 2021 · 16 revisions

๋ชฉ์ฐจ
06/03
06/02
06/01
05/31
05/30
05/29
05/28
05/27
05/26
05/25
05/24


06/01

๐Ÿ“Š Experiments

  • user์˜ ๊ฐ™์€ ์‹œํ—˜์ง€๋ฅผ ์—ฐ๋‹ฌ์•„ ๋ณด๋Š” ๊ฒฝ์šฐ, ์˜ˆ์™ธ๋ฅผ ์ฒ˜๋ฆฌํ•˜์ง€ ๋ชปํ–ˆ๋˜ ์˜ค๋ฅ˜ ํ•ด๊ฒฐ

  • ๋‚ด๊ฐ€ ์ถ”๊ฐ€ํ•œ feature๋ฅผ ์‚ฌ์šฉํ•˜๋˜ All data๋Œ€์‹  Train data๋งŒ ์‚ฌ์šฉํ•˜์—ฌ ์„ฑ๋Šฅ ํ™•์ธ

    • ์„ฑ๋Šฅ ํ–ฅ์ƒ! (0.7641 โ†’ 0.7661, 0.002!)
    Valid AUC LB
    0.7706 0.7661
  • FEATURE ์ •๋ฆฌ์— ์˜ฌ๋ ค๋‘” ๋ชจ๋“  ํŠน์ง• ์‚ฌ์šฉํ•˜์—ฌ LSTM ๋ชจ๋ธ์— ์‚ฌ์šฉ

    • CUDA error๊ฐ€ ๋ฐœ์ƒํ•ด์„œ, ์šฐ์„  ๋ฏผ์šฉ+์•„๋ผ feature๋งŒ ์ถ”๊ฐ€ํ•˜์—ฌ ์‹คํ—˜ ์ง„ํ–‰
    • ์˜คํžˆ๋ ค ์„ฑ๋Šฅ์ด ์ €ํ•˜๋˜์—ˆ์Œ
    Valid AUC LB
    0.7701 0.7626
  • SAINT ๋…ผ๋ฌธ abtract, introduction๋ณด๊ณ  ์ด์ „๊นŒ์ง€ ์—ฐ๊ตฌ ํ๋ฆ„ ํŒŒ์•…ํ•˜๊ธฐ

05/31

๐Ÿ“Š Experiments

  • Feature ์ถ”๊ฐ€
    • userTestTime: ์ฒซ ๋ฌธ์ œ๋ฅผ ํ’€๊ธฐ ์‹œ์ž‘ํ•œ ์‹œ๊ฐ„์œผ๋กœ๋ถ€ํ„ฐ ํ˜„์žฌ ๋ฌธ์ œ๋ฅผ ํ’€๊ธฐ๊นŒ์ง€ ๊ฒฝ๊ณผ์‹œ๊ฐ„
    • userTestContAnswer: user์˜ ์‹œํ—˜์ง€ ๋ณ„ ์—ฐ์† ์ •๋‹ต ์ˆ˜
    • userTestContWrong: user์˜ ์‹œํ—˜์ง€ ๋ณ„ ์—ฐ์† ์˜ค๋‹ต ์ˆ˜

05/30

๐Ÿ“Š Experiments

  • Feature ์ถ”๊ฐ€
    • userTestAnswer: user์˜ ํ’€๊ณ ์žˆ๋Š” ์‹œํ—˜์ง€ ์ค‘ ์ด์ „๊นŒ์ง€ ์ •๋‹ต์„ ๋งž์ถ˜ ๊ฐœ์ˆ˜
    • userTestWrong: user์˜ ํ’€๊ณ ์žˆ๋Š” ์‹œํ—˜์ง€ ์ค‘ ์ด์ „๊นŒ์ง€ ์ •๋‹ต์„ ํ‹€๋ฆฐ ๊ฐœ์ˆ˜

05/29

๐Ÿ“Š Experiments

  • ํ˜„์žฌ๊นŒ์ง€ user๊ฐ€ ๋ฌธ์ œ๋ฅผ ํ‘ธ๋Š”๋ฐ ์†Œ์š”๋œ ํ‰๊ท  ์‹œ๊ฐ„ feature๋ฅผ ํฌํ•จํ•˜์—ฌ ์‹คํ—˜
  • hyperparameter๋Š” ๋ฏผ์šฉ์ด์˜ best model์„ ๊ทธ๋Œ€๋กœ ์‚ฌ์šฉ
  • userSolTime ํŠน์ง• ์ถ”๊ฐ€, ๊ธฐ์กด์˜ ํŠน์ง• ํ•จ๊ป˜ ์‚ฌ์šฉ (categorical: 3, continuous: 7)
  • 10-fold (์ฐธ๊ณ ํ•œ hyperparameter๋ฅผ ํ•˜๋‚˜์”ฉ ์ˆ˜์ •ํ•˜๋ฉด์„œ ์‹คํ—˜)
Seed Valid AUC LB
42 0.7624 0.7634

05/28

๐Ÿ… Peer-session

  • user๊ฐ€ ํ•˜๋‚˜์˜ ์‹œํ—˜์„ ๋๋‚ธ ์‹œ์ ์„ ์ •ํ™•ํžˆ ๊ตฌ๋ถ„ํ•˜์ž!
    • ์ด ๋ถ€๋ถ„์ด ์˜ˆ์™ธ๊ฐ€ ์žˆ์„ ๊ฒƒ ๊ฐ™์•„์„œ ์—ฌ๋Ÿฌ๋ฒˆ ์ด์•ผ๊ธฐํ•ด๋ณธ ๊ฒฐ๊ณผ ๋ฏผ์šฉ์ด๊ฐ€ ์•„์ฃผ ์ข‹์€ ์ฝ”๋“œ๋ฅผ ํ†ตํ•ด ๊ณ ๋ คํ•  ์ˆ˜ ์žˆ๋„๋ก ํ•จ (discussion ์ฐธ๊ณ ํ•˜๊ธฐ!)
  • ์žฌํ›ˆ์˜ค๋น ๊ฐ€ ์˜ฌ๋ฆฐ feature idea๋ฅผ ๋‹ค๋ฅธ ํŒ€์›๋“ค๋„ ๋‚˜๋ˆ ์„œ ์ง„ํ–‰ํ•ด๋ณด๋„๋ก ํ•จ

๋‚ด๊ฐ€ ๋งก์€ ๋ถ€๋ถ„
5. ํ˜„์žฌ๊นŒ์ง€ user๊ฐ€ ๋ฌธ์ œ๋ฅผ ํ‘ธ๋Š”๋ฐ ์†Œ์š”๋œ ํ‰๊ท  ์‹œ๊ฐ„
6. ํ•œ ์‹œํ—˜์ง€๋ฅผ ํ’‚์— ์žˆ์–ด์„œ ์ด ๊ฑธ๋ฆฐ ์‹œ๊ฐ„
7. ์ฒซ ๋ฌธ์ œ๋ฅผ ํ’€๊ธฐ ์‹œ์ž‘ํ•œ ์‹œ๊ฐ„์œผ๋กœ๋ถ€ํ„ฐ ํ˜„์žฌ ๋ฌธ์ œ๋ฅผ ํ’€๊ธฐ๊นŒ์ง€ ๊ฒฝ๊ณผ์‹œ๊ฐ„
8. ์ด์ „๊นŒ์ง€ ์ •๋‹ต์„ ์—ฐ์†์œผ๋กœ ๋ช‡๋ฒˆ ๋งžํ˜”๋Š”์ง€ - ํ˜„์žฌ ๋ณด๊ณ  ์žˆ๋Š” ์‹œํ—˜์— ํ•œํ•ด์„œ ์นด์šดํŒ…
9. ์ด์ „๊นŒ์ง€ ์ •๋‹ต์„ ์—ฐ์†์œผ๋กœ ๋ช‡๋ฒˆ ํ‹€๋ ธ๋Š”์ง€ - ํ˜„์žฌ ๋ณด๊ณ  ์žˆ๋Š” ์‹œํ—˜์— ํ•œํ•ด์„œ ์นด์šดํŒ…
10. ํ•™์ƒ์˜ ์‹œํ—˜์ง€ ๋ณ„ ์ •๋‹ต๋ฅ 

๐Ÿ”” Mentoring

  • feature๋ฅผ ๊ณ ๋ คํ•˜๋Š”๊ฒŒ ๊ฐ€์žฅ ์ค‘์š”ํ•˜๋‹ค๊ณ  ๋Š๊ปด์ง
    • ์–ด๋–ค feature๋ฅผ ์‚ฌ์šฉํ–ˆ๋Š”์ง€ ์น˜์—ดํ•˜๊ฒŒ ํ† ๋ก ํ•˜๋Š”๊ฒŒ ์ข‹์„ ๊ฒƒ ๊ฐ™์Œ
  • output์— ๋Œ€ํ•œ ๋ถ„์„์ด ์•„์ฃผ ์ค‘์š”!!
    • ๊ฐ๊ฐ ์ฐพ์•„์„œ ํ”ผ์–ด์„ธ์…˜์—์„œ ๊ณต์œ ํ•˜๋Š”๊ฒŒ ์ข‹์„ ๊ฒƒ ๊ฐ™์Œ
    • ๋‚˜๋ˆ ์„œ ํ•ด๋ณด๊ณ , ๊ฐ™์ด ์ด์•ผ๊ธฐ

05/27

๐Ÿ“Š Experiments

  • feature๋ฅผ ์ ์  ์ถ”๊ฐ€ํ•  ๋•Œ๋งˆ๋‹ค ๋ณ€๊ฒฝํ•ด์ค˜์•ผํ•˜๋Š” ๊ณณ์ด dataloader.py, trainer.py, model.py ๋กœ ๋„ˆ๋ฌด ๋งŽ์€ ๋ถ€๋ถ„์„ ๊ฑด๋“œ๋ ค์•ผํ•ด์„œ ์ด ๋ถ€๋ถ„์„ ์‹คํ—˜์— ์šฉ์ดํ•˜๋„๋ก ๊ตฌ์กฐ๋ฅผ ์ˆ˜์ •
  • dataloader.py์˜ load_data_from_file ํ•จ์ˆ˜์•ˆ์„ ๋‹ค์Œ๊ณผ ๊ฐ™์ด ๋ณ€๊ฒฝ
    • n_cates, n_cons, cate_embs ๋ณ€์ˆ˜๋ฅผ ๋งŒ๋“ค์–ด์„œ model ๊ตฌ์กฐ๋˜ํ•œ ์œ ๋™์ ์œผ๋กœ ๋ณ€๊ฒฝ์ด ๊ฐ€๋Šฅํ•˜๋„๋กํ•จ!
# =============================== !!!!์—ฌ๊ธฐ๋งŒ ์ฃผ์˜ํ•˜์ž!!!! ===============================      
columns = ["userID", "testId", "assessmentItemID", "KnowledgeTag", 
                   "correctRate", "correctAnswer", "totalAnswer", "userAcc", "test_mean", "tag_mean", "answerCode"]
args.n_cates = 3
args.n_cons = 6
# ========================================================================================
args.cate_embs = []
for c in columns[1: args.n_cates+1]:
		args.cate_embs.append(len(np.load(os.path.join(self.args.asset_dir, f"{c}_classes.npy"))))
        
args.n_cates += 1
args.cate_embs.append(3)
  • ์ถ”๊ฐ€๋กœ ํ•„์š”ํ•œ ์ˆ˜์ •์ด ํ•„์š”ํ•œ ๋ถ€๋ถ„์€ ๋ฏผ์šฉ์ด๊ฐ€ ๋งก์•„์„œ ์ง„ํ–‰ํ•˜๊ธฐ๋กœ ํ•จ
    • embedding layer๋งˆ๋‹ค hidden_dim์˜ ์ฐจ์›์„ ๋‹ค๋ฅด๊ฒŒ ํ•ด์ค€๋‹ค๋˜์ง€, LayerNorm ์ถ”๊ฐ€ ๋“ฑ

05/26

๐Ÿ… Peer-session

  • ์ƒ๊ฐ๋ณด๋‹ค ์‹คํ—˜ํ–ˆ๋˜ ๋ฐ์ดํ„ฐ ๋ถ„๋ฆฌ ์กฐ๊ฑด์ด ์ข‹์ง€ ์•Š์•˜์Œ..๐Ÿ˜ฅ
    • ์ด์ „ ์‹คํ—˜์—์„œ ์ •ํ•œ seed ๊ฐ’์ด model์—๋งŒ ์ ์šฉ๋œ๊ฑฐ๋ผ ๋ฐ์ดํ„ฐ์˜ ๋ถ„๋ฆฌ๋˜๋Š” ์กฐ๊ฑด์ด ๋‹ฌ๋ผ์ง€๋ฉด์„œ valid์™€ LB์˜ ๊ฐ„๊ฒฉ์ด ๋„“์–ด์ง„๊ฑด๊ฐ€?
    • seed๋ฅผ 42๋กœ ๋‘๊ณ  ๋Œ๋ ค๋„ 406๊ณผ ๋ณ„๋กœ ์ฐจ์ด๊ฐ€ ์—†์—ˆ์Œ!
    • ์ด๊ฑฐ๋Š” ์กฐ๊ฑด์ด ์ข‹์ง€ ์•Š์•˜์„ ํ™•๋ฅ ์ด ๋” ํฐ ๊ฒƒ ๊ฐ™๋‹ค..

๐Ÿ“Š Experiments

  • user์˜ ์‹œํ—˜์ง€(testID)๋งˆ๋‹ค ์ •๋‹ต๋ฅ  feature๋ฅผ ์ถ”๊ฐ€ํ–ˆ์„ ๋•Œ, categorical๋กœ ์ƒ๊ฐํ•˜์—ฌ์„œ embedding layer๋ฅผ ๊ฑฐ์น˜๊ณ  ๋‹ค๋ฅธ feature๋“ค๊ณผ ํ•ฉ์ณ์„œ(concat) ์‚ฌ์šฉํ–ˆ๋Š”๋ฐ ์˜ค๋Š˜ ํŒ€์›๋“ค์—๊ฒŒ๋„ ๋ฌผ์–ด๋ณธ ๊ฒฐ๊ณผ continuousํ•œ ํŠน์ง•์ธ ๊ฒƒ ๊ฐ™๋‹ค๊ณ  ๊ฒฐ๋ก ์„ ๋‚ด๋ฆผ!

  • LGBM baseline code์— ์žˆ๋˜ feature 2๊ฐœ์™€ ์–ด์ œ ์ถ”๊ฐ€ํ–ˆ๋˜ feature๋ฅผ ๋ฐ”๋กœ linear layer์— ๋„ฃ์€ ๋’ค embedding feature๋“ค๊ณผ ํ•ฉ์ณ์„œ comb_proj layer๋ฅผ ํ†ต๊ณผํ•˜๋„๋ก ๋ณ€๊ฒฝํ•˜์—ฌ ์‹คํ—˜์„ ์ง„ํ–‰ โ–ถ baseline (0.7361) > add_correctRatio (0.7342) > add_3features (0.7333)๋ฅผ ๋ณด์ž„ ๐Ÿ˜ข

  • ์•„์ง๊นŒ์ง€ embedding layer~comb_proj layer ๋ถ€๋ถ„์„ ์ดํ•ดํ•˜์ง€ ๋ชปํ•œ ๊ฒƒ ๊ฐ™๋‹ค! ์ด ๋ถ€๋ถ„ ๋” ํ™•์ธํ•˜๊ธฐ

05/25

๐Ÿ… Peer-session

  • data๋ฅผ ๋ถ„ํ• ํ•  ๋•Œ, seed๊ฐ’์€ 0์œผ๋กœ ๊ณ ์ •๋˜์–ด ์žˆ์—ˆ์Œ
  • model์˜ seed๊ฐ’์€ ๊ฐ€์žฅ ์ข‹์€ ์„ฑ๋Šฅ์„ ๋ณด์ด๊ณ  ์ ์€ ์ฐจ์ด์˜€๋˜ "406"์œผ๋กœ ๊ฒฐ์ •

๐Ÿ’ซ Ideas

  • ์–ด๋–ค feature๊ฐ€ ๋„์›€์ด ๋ ๊นŒ? "๋‚ด๊ฐ€ ๋ชจ๋ธ์ด๋ผ๊ณ  ์ƒ๊ฐํ•ด๋ณด์„ธ์š”"

    • ๊ฒฐ๊ตญ user์˜ history๋ฅผ ์‚ฌ์šฉํ•˜์—ฌ next๋ฅผ ๋งž์ถ”๋Š”๊ฒƒ์ด๊ธฐ ๋•Œ๋ฌธ์— history์—์„œ ์ถฉ๋ถ„ํ•œ feature๊ฐ€ ์žˆ๋‹ค๋ฉด ๋ชจ๋ธ์—๊ฒŒ ์œ ์šฉํ•  ๊ฒƒ ๊ฐ™์Œ
    • ์ด์ „ ๋ฌธ์ œ์—์„œ ๊ฑธ๋ ธ๋˜ ์‹œ๊ฐ„? ํ˜น์€ ํ˜„์žฌ๊นŒ์ง€ user๊ฐ€ ๋ฌธ์ œ๋ฅผ ํ‘ธ๋Š”๋ฐ ์†Œ์š”๋œ ํ‰๊ท  ์‹œ๊ฐ„์„ ์•Œ๋ฉด ์—ด์‹ฌํžˆ ํ‘ธ๋Š” ์‚ฌ๋žŒ๊ณผ ๊ทธ๋ ‡์ง€ ์•Š์€ ์‚ฌ๋žŒ(์ฐ๋Š” ๊ฒฝ์šฐ)๋ฅผ ๊ตฌ๋ถ„ํ•  ์ˆ˜ ์žˆ์ง€ ์•Š์„๊นŒ?
    • user์˜ ํ•œ ์ •๋ณด๋งˆ๋‹ค ์ •๋‹ต๋ฅ ์„ ์ถ”๊ฐ€๋กœ ๋„ฃ์–ด์ฃผ๋ฉด ์ข‹์ง€ ์•Š์„๊นŒ? -- ์กฐ๊ธˆ ๋” ์ถ”๊ฐ€ํ•œ๋‹ค๋ฉด, user์˜ ๋ฌธ์ œ๋‚œ์ด๋„(9๊ฐœ)๋ณ„ ์ •๋‹ต๋ฅ ?
  • ์–ด๋–ค๊ฑธ ์‹œ๋„ํ•ด๋ณผ๊นŒ?

    • ๋ฌธ์ œ๋ณ„ ๋‚œ์ด๋„, user์˜ time๋งˆ๋‹ค ์ •๋‹ต๋ฅ , ํ˜„์žฌ๊นŒ์ง€ user๊ฐ€ ๋ฌธ์ œ๋ฅผ ํ‘ธ๋Š”๋ฐ ์†Œ์š”๋œ ํ‰๊ท  ์‹œ๊ฐ„

๐Ÿ“Š Experiments

  • user์˜ ์‹œํ—˜์ง€(testID)๋งˆ๋‹ค ์ •๋‹ต๋ฅ  feature๋ฅผ ์ถ”๊ฐ€ํ•˜์—ฌ ์‹คํ—˜ ์ง„ํ–‰ (seed: 406) โ–ถ baseline ์„ฑ๋Šฅ์ธ 0.7361๋ณด๋‹ค ํ•˜๋ฝํ•œ 0.7342๋ฅผ ๋ณด์ž„, ์ด๋•Œ Valid AUC: 0.7624

๐Ÿ’ก class

  1. ํ‰๊ท ์ ์œผ๋กœ ๋ฌธ์ œ๋ฅผ ํ‘ธ๋Š”๋ฐ ์†Œ์š”๋˜๋Š” ์‹œ๊ฐ„๊ณผ ๊ฐ user์˜ ๋ฌธ์ œ ํ’€์ด ์‹œ๊ฐ„์˜ ์ฐจ์ด๋„ ํ•˜๋‚˜์˜ feature๊ฐ€ ๋  ์ˆ˜ ์žˆ์Œ
  2. ์ ‘๊ทผ๋ฐฉ๋ฒ• ๋ฌธ์ œ, ๋ฌธํ•ญ, tag์— ๋Œ€ํ•œ ์ •๋‹ต๋ฅ ์„ ํ•˜๋‚˜์˜ feature๋กœ๋„ ์‚ฌ์šฉํ•  ์ˆ˜ ์žˆ์Œ

05/24

๐Ÿ“Š Experiments
Seed๊ฐ’์— ๋”ฐ๋ฅธ LB์™€ Validation ์„ฑ๋Šฅ ์ฐจ์ด ํ™•์ธ ์‹คํ—˜

์‚ฌ์šฉํ•œ seed list - 28, 81, 1109, 1996, 8888

Seed Valid AUC LB ์ฐจ์ด
28 0.7381 0.7234 0.0597
81 0.7389 0.7207 0.0182
1109 0.7336 0.7241 0.0095
1996 0.7366 0.7178 0.0188
8888 0.7378 0.7227 0.0151

โ–ถ 1109๋ฅผ seed๋กœ ์‚ฌ์šฉํ•˜์˜€์„ ๋•Œ, ๊ฐ€์žฅ ์ข‹์€ ์„ฑ๋Šฅ์„ ๋ณด์ด๊ณ  ๊ทธ ์ฐจ์ด๋„ ๊ฐ€์žฅ ์ ์—ˆ์Œ