Automatic detection of database joinability risks
Google has a lot of data. This data lies in a large number of databases. Some of these databases should not be joined, notably for privacy or regulatory reasons. This talk will present an in-house framework based on sketching algorithms, that monitors for joinability risks in an automated and privacy-sensitive way at Google scale.