2021-06-01 Dogs

Data on Dogs in Switzerland.

The data is from Zurich, Switzerland, courtesy of the internet. The idea to analyze this dataset comes from a post on Kaggle.

Translation

German is the official language of Zurich. Most of the columns are in German, but the translations to English aren’t too tricky. Use a translator if you find something you do not understand.

  • ALTER -> Age
  • GESCHLECHT -> Gender (w = weiblich, female; m = männlich, male)
  • STADTKREIS -> City Quarter or District
  • RASSE1 -> Dog’s Primary Breed
  • RASSE2 -> Dog’s Secondary Breed
  • GEBURTSJAHR_HUND -> Dog’s Year of Birth
  • GESCHLECHT_HUND -> Dog’s Gender
  • HUNDEFARBE -> Dog’s Color

Data

Dog data from Zurich. There should be a notebook available to load the data and do some initial cleaning.

Questions

We will use 2015 to begin. Later we can combine all of the information into a single data frame to see if more can be learned by combining data from multiple years.

  • How many dogs are there?
  • How many male vs female dogs?
  • Analyze the ages of the owners?
  • The birth year of the dog has some obvious errors. Find them and fix them. See the fixing code included in the notebook.
  • Does id uniquely identify the dog registration information? Explain your reasoning and give supporting evidence.
Last modified August 18, 2023: 2022-2023 End State (7352e87)