<div dir="ltr"><div><div>Hello Everyone,<br></div>Due to popular request our reading group will be <b><span style="background-color:rgb(255,229,153)">for tomorrow, Wednesday at 11 am</span></b> in the MAT conference room.<br>
</div>Hope to see you there!<br><br><div><b>Talk Abstract:</b></div><div><span style="white-space:pre-wrap"> </span>All
pairs similarity search is used in many applications such as duplicate
detections, data cleaning and clustering. It is still a time consuming
process and in this proposal I present our contributions to speeding up
the process using exact and efficient parallel executions. These
techniques can be summarized as data partitioning, load balancing and
memory optimizations. They have shown to add an order of magnitude over
state of the art algorithms for similarity search using five datasets.</div><div><br></div><div><b>Keywords:</b> <br>Modeling, algorithms, performance</div><br><div class="gmail_extra"><br><br>
</div></div>