DBLP Co-authorship Data


This data is completely gathered by ourselves. Nodes are computer scientists in DBLP records and edges represent a co-authorship incident.


  • A An NxN adjacency matrix
  • A_label Node names (names of authors)
  • F Attribute matrix. Each row is a node and each column is an attribute.
  • F_labelName of each attribute. Attributes are venues of publication in CS.
  • dataset_nameThe name of the dataset.


If you find this dataset useful in your research, we ask that you cite the following paper:

    author = {Rezaei, Aria and Perozzi, Bryan and Akoglu, Leman},
    title = {Ties That Bind: Characterizing Classes by Attributes and Social Ties},
    booktitle = {Proceedings of the 26th International Conference on World Wide Web Companion},
    series = {WWW '17 Companion},
    year = {2017},
    isbn = {978-1-4503-4914-7},
    location = {Perth, Australia},
    pages = {973--981},
    numpages = {9},
    url = {https://doi.org/10.1145/3041021.3055138},
    doi = {10.1145/3041021.3055138},
    acmid = {3055138},
    publisher = {International World Wide Web Conferences Steering Committee},
    address = {Republic and Canton of Geneva, Switzerland},
    keywords = {attributed graphs, community understanding, homophily, social networks, subspace discovery},