AI and Ethical use of data: Data Validity Part 1
March 13, 2019 Leave a comment
As mentioned yesterday I had the pleasure of taking a course on Ethics and Data Science. Given that data science is a key area of the Machine Learning area of AI I thought I would expand on the subject as a starting point. Each bullet I discuss has a more detailed discussion required. But I truly believe that Ethical/Business Conduct requirements will be needed for all AI projects in order to provide transparency and explainability.
So what are the risk considerations for ethics in AI ? You will notice that there is overlapping considerations that have to be thought of and included.
- Data Validity
- Algorithim Fairness
- Informed Consent
- Model Errors
- Societal Impact
- Ossification/Rigidity of ML models
- Surveillance Impacts
- Managing Change
- Regression
- Bias/Variance
So as an example of data validity: and I am starting to see this criteria being included since it has a high risk or legal ramification. In this day and age of access to third party data sets ( legally and illegally ) or the data sets that you have collected as an organization. Are you questioning where the data comes from and if proper “informed consent” was given by individual providing that data or information ? Did third party organizations thoroughly vet and validate the information ? Has it been modified or redacted or scrambled ? Can you still identify individuals or information by extrapolation ? What proof will stand up in court if you are sued for accessing information not properly vetted by a third party. Are you moving data from one line of business ( sales to marketing ) and are you violating any agreements that you have with clients or leads ?
The course I took was done a few years ago and had a very optimistic tone to it that regulation would slow down innovation or drown skills; the recommending direction from the professor at that time was: don’t surprise people with outcomes and be able to explain how the model got to that outcome but leave how and what we analyze to the data scienctist. I think we will see regulations step in. Any time you have a practice: medicine, legal, engineering, real estate or accounting regulation has to be in place to protect human rights and the individual human.