您的位置: 专家智库 > >

谢英豪

作品数:1 被引量:0H指数:0
供职机构:东南大学计算机科学与工程学院更多>>
发文基金:国家自然科学基金更多>>
相关领域:自动化与计算机技术更多>>

文献类型

  • 1篇中文期刊文章

领域

  • 1篇自动化与计算...

主题

  • 1篇WISE
  • 1篇AGGREG...
  • 1篇DATA_S...
  • 1篇HASHIN...
  • 1篇MI
  • 1篇N-

机构

  • 1篇东南大学

作者

  • 1篇谢英豪
  • 1篇吕建华
  • 1篇崇志宏
  • 1篇倪巍伟
  • 1篇徐立臻

传媒

  • 1篇Journa...

年份

  • 1篇2009
1 条 记 录,以下是 1-1
排序方式:
Min-wise hash function-based sampling over distributed data streams
2009年
In order to avoid the redundant and inconsistent information in distributed data streams, a sampling method based on min-wise hash functions is designed and the practical semantics of the union of distributed data streams is defined. First, for each family of min-wise hash functions, the data with the minimum hash value are selected as local samples and the biased effect caused by frequent updates in a single data stream is filtered out. Secondly, for the same hash function, the sample with the minimum hash value is selected as the global sample and the local samples are combined at the center node to filter out the biased effect of duplicated updates. Finally, based on the obtained uniform samples, several aggregations on the defined semantics of the union of data streams are precisely estimated. The results of comparison tests on synthetic and real-life data streams demonstrate the effectiveness of this method.
崇志宏倪巍伟徐立臻吕建华谢英豪
关键词:AGGREGATION
共1页<1>
聚类工具0