auctus |
Auctus |
other |
9 |
This project is a web crawler and search engine for datasets, specifically meant for data augmentation tasks in machine learning. It is able to find datasets in different repositories and index them for later retrieval. |
googledatastudio |
Google Data Studio |
other |
14 |
With Data Studio, you can easily report on data from a wide variety of sources, without programing. In just a few moments, you can connect to data sets such as: Databases, including BigQuery, MySQL, and PostgreSQL. Google Marketing Platform products, including Google Ads, Analytics, Display & Video 360, Search Ads 360. |
ibmguardium |
IBM Security Guardium Analyzer |
pii |
16 |
IBM Security Guardium Analyzer is a powerful service that can help you with this data security effort. Simple to use, Guardium Analyzer allows you to set up connections to your data sources - with no need to configure classification or risk scanning. |
metabase |
Metabase |
other |
15 |
Metabase is a simple and powerful analytics tool which lets anyone learn and make decisions from their company s data no technical knowledge required. |
metacrafter |
metacrafter |
detector |
48 |
Python command line tool and python engine to label table fields and fields in data files. It could help to find meaningful data in your tables and data files or to find Personal identifable information (PII). |
metacrafterpro |
metacrafterpro |
detector |
118 |
Cloud web service based on Metacrafter open source (https://github.com/apicrafter/metacrafter) available for beta testing by request |
metadata-guardian |
Metadata Guardian |
pii |
15 |
Metadata Guardian is a Python package that provides an easy way to protect your data sources by searching its metadata. By searching with data rules, it will detect what you are looking to protect. Using Rust, it makes blazing fast multi-regex matching. |
piicatcher |
PII Catcher |
pii |
11 |
PIICatcher is a scanner for PII and PHI information. It finds PII data in your databases and file systems and tracks critical data. |
presidio |
Microsoft Presidio |
pii |
28 |
Presidio (Origin from Latin praesidium protection, garrison) helps to ensure sensitive data is properly managed and governed. It provides fast identification and anonymization modules for private entities in text such as credit card numbers, names, locations, social security numbers, bitcoin wallets, US phone numbers, financial data and more. |
scrubadub |
scrubadub |
pii |
7 |
Remove personally identifiable information from free text. Sometimes we have additional metadata about the people we wish to anonymize. Other times we dont. This package makes it easy to seamlessly scrub personal information from free text, without compromising the privacy of the people we are trying to protect.` |
soda |
Soda |
other |
8 |
Soda allows everyone on your data team to find, analyze, and resolve data issues. Our open-source tools and data observability platform bring everyone closer to the data, resulting in data products that you can trust. |
sqldatacatalog |
RedGate SQL Data catalog |
catalogs |
21 |
SQL Data Catalog helps you protect your business by providing a scalable and repeatable process for managing personally-identifiable information (PII), as it moves through your SQL Server estate. Our solution lets you create a data catalog of your SQL Server estate, and monitors and controls it for protection appropriate to the sensitivity of the data, ensuring compliance during data handling. |