Amazon Co-purchaseship Data


This dataset is originally taken from Jure Leskovec's collection of datasets at SNAP. We have cleaned the dataset and divided them into 4 categories of Books, Music, Videos and DVD and saved them in MATLAB format. You can find the 4 categories combined in amazon.mat.


  • A An NxN adjacency matrix
  • A_label Node names (names products)
  • F Attribute matrix. Each row is a node and each column is an attribute.
  • F_labelName of each attribute. Attributes can be genres, franchise name or many more.
  • dataset_nameThe name of the dataset.


If you find this dataset useful in your research, we ask that you cite the following paper:

    author = {Rezaei, Aria and Perozzi, Bryan and Akoglu, Leman},
    title = {Ties That Bind: Characterizing Classes by Attributes and Social Ties},
    booktitle = {Proceedings of the 26th International Conference on World Wide Web Companion},
    series = {WWW '17 Companion},
    year = {2017},
    isbn = {978-1-4503-4914-7},
    location = {Perth, Australia},
    pages = {973--981},
    numpages = {9},
    url = {},
    doi = {10.1145/3041021.3055138},
    acmid = {3055138},
    publisher = {International World Wide Web Conferences Steering Committee},
    address = {Republic and Canton of Geneva, Switzerland},
    keywords = {attributed graphs, community understanding, homophily, social networks, subspace discovery},