Gregory Todd Williams Graph Summaries Jesse Weaver

From Semantic Portal Wiki

Jump to: navigation, search
  • Question is for the Presentation: Gregory Todd Williams Graph Summaries
  • Question is asked by: Jesse Weaver
  • The Question is: Consider Figures 8a-f. For MD-Tree, the worst case shown is lg(error)=25, which means that error=2^25=33,554,432. How is it that MD-Tree estimated a frequency that deviated from the actually frequency by about 33.5x10^6 when there aren't even that many nodes or edges in the SwetoDBLP dataset? (or did I read something wrong?) In Figures 8g-h, the worst case for P-Tree is similar and on an even smaller dataset (TOntoGen).
  • Answer: The evaluation section is a mess. The axis labeling in figure 8 is unintelligible if, as you mention, the values are to be understood as lg(error) = ~25. Another issue in the evaluation is the (somewhat suspicious) use of a subset of SwetoDBLP that has only ~0.8 edges for every node (which seems like it might lead to a somewhat uninteresting graph).
Facts about Gregory Todd Williams Graph Summaries Jesse WeaverRDF feed
Question answerThe evaluation section is a mess. The axis The evaluation section is a mess. The axis labeling in figure 8 is unintelligible if, as you mention, the values are to be understood as lg(error) = ~25. Another issue in the evaluation is the (somewhat suspicious) use of a subset of SwetoDBLP that has only ~0.8 edges for every node (which seems like it might lead to a somewhat uninteresting graph). t lead to a somewhat uninteresting graph).
Question askedConsider Figures 8a-f. For MD-Tree, the w Consider Figures 8a-f. For MD-Tree, the worst case shown is lg(error)=25, which means that error=2^25=33,554,432. How is it that MD-Tree estimated a frequency that deviated from the actually frequency by about 33.5x10^6 when there aren't even that many nodes or edges in the SwetoDBLP dataset? (or did I read something wrong?) In Figures 8g-h, the worst case for P-Tree is similar and on an even smaller dataset (TOntoGen). and on an even smaller dataset (TOntoGen).
Question asked byJesse Weaver  +
Question for the PresentationGregory Todd Williams Graph Summaries  +
Personal tools
Semantic Web Community
Tetherless World constellation
maintenance