The Second Eigen Value And Its Applications

'Project Overview'

This project investigates the second eigenvalue of the Google Matrix and its applications, particularly in detecting link spamming and enhancing PageRank stability. The project is based on the paper "The Second Eigenvalue of the Google Matrix" by Taher H. Haveliwala and Sepandar D. Kamvar. The primary objective is to analytically determine the modulus of the second eigenvalue for the web hyperlink matrix used by Google for computing PageRank.

Introduction

This report analyzes the second eigenvalue of the Google Matrix. Specifically, it proves the following statement: "For any matrix ( A = [cP + (1 − c)E]^T ), where ( P ) is an ( n × n ) row-stochastic matrix, ( E ) is a non-negative ( n × n ) rank-one row-stochastic matrix, and ( 0 ≤ c ≤ 1 ), the second eigenvalue of ( A ) has modulus ( |λ_2| ≤ c ). Furthermore, if ( P ) has at least two irreducible closed subsets, the second eigenvalue ( λ_2 = c )."

This has implications for the convergence rate of PageRank, its stability to link structure perturbations, the detection of Google spammers, and the design of algorithms to speed up PageRank.

Preliminaries and Notations

( P ): ( n × n ) row-stochastic matrix.
( E ): ( n × n ) rank-one row-stochastic matrix, ( E = ev^T ), where ( e ) is an ( n )-vector with elements ( e_i = 1 ).
( A ): Column-stochastic matrix, ( A = [cP + (1− c)E]^T ).
( λ_i ): ( i )-th eigenvalue of ( A ).
( x_i ): Corresponding eigenvector of ( λ_i ).

Results

Theorem 1

For ( P ) as an ( n × n ) stochastic matrix and ( c ) such that ( 0 ≤ c ≤ 1 ), with ( E ) a rank-one row-stochastic matrix ( E = ev^T ), the eigenvalue ( |λ_2| ≤ c ).

Theorem 2

If ( P ) has at least two irreducible closed subsets, then the second eigenvalue ( λ_2 = c ).

Applications

Link Spamming

Link spamming manipulates the PageRank algorithm to artificially boost the ranking of websites. This section discusses how the second eigenvalue helps in detecting such manipulative tactics.

Energy in PageRank

Introduces the concept of energy in PageRank, which relates to the distribution and retention of PageRank within a network of web pages.

Methods to Boost PageRank

Two methods are described to increase PageRank:

Method 1: Addition of promotion nodes.
Method 2: Creation of an irreducible closed subset.

Irreducible Closed Subsets and Link Spamming

Analyzes the impact of irreducible closed subsets on PageRank and their role in link spamming.

Second Eigenvector and Link Spamming

Examines how the second eigenvector aids in detecting abnormalities in PageRank caused by link spamming.

Algorithms

Block Power Method

Algorithm to compute the second eigenvector using the block power method.

Simple Power Method

Algorithm to compute the second eigenvector using the simple power method.

Work Distribution

Details the distribution of work among team members.

Appendix

Contains supplementary materials and proofs of theorems and lemmas discussed in the report.

References

List of references used in the project.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
README.md		README.md
The Second Eigen Value And Its Applications.pdf		The Second Eigen Value And Its Applications.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

The Second Eigen Value And Its Applications

'Project Overview'

Table of Contents

Introduction

Preliminaries and Notations

Results

Theorem 1

Theorem 2

Applications

Link Spamming

Energy in PageRank

Methods to Boost PageRank

Irreducible Closed Subsets and Link Spamming

Second Eigenvector and Link Spamming

Algorithms

Block Power Method

Simple Power Method

Work Distribution

Appendix

References

About

Releases

Packages

Nandini-Jaiswal/The-Second-Eigen-Value-And-Its-Applications

Folders and files

Latest commit

History

Repository files navigation

The Second Eigen Value And Its Applications

'Project Overview'

Table of Contents

Introduction

Preliminaries and Notations

Results

Theorem 1

Theorem 2

Applications

Link Spamming

Energy in PageRank

Methods to Boost PageRank

Irreducible Closed Subsets and Link Spamming

Second Eigenvector and Link Spamming

Algorithms

Block Power Method

Simple Power Method

Work Distribution

Appendix

References

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages