• +90 537 656 74 31
  • [email protected]
CONTACT US
تواصل معنا
  • العربية
  • Market Research
    • Feasibility studies
    • Marketing research
    • Products development
    • Testing customers experience
    • Studying customers satisfaction
    • Interactive business follow-up platform
  • SOCIO-POLITICS
    • Polls
    • Indicator’s development
    • Metrics development
    • Research partnership
    • Research statistical testing
  • Meal & TPM
    • Third-Party Monitoring (TPM)
    • Final Evaluation
    • Humanitarian needs assessment
    • Impact analysis
    • Market assessment
    • Development of MEAL policies & SOPs
    • Verification
  • Training
  • Agenda
    • Research studies
    • Guides
    • Interactive reports
    • Blog
    • FAQs
  • About us
FREE CONSULTATION

REQUEST A FREE CONSULTATION

    CONTACT INFORMATION

    İncilipınar Mah, Nail Bilen cd, Kantar İş Merkezi, kat 4, ofis 404, Şehitkamil, Gaziantep, Turkey

    +90 537 656 74 31

    [email protected]

    24/7 SUPPORT

    Unlimited help desk

    Menu
    • العربية
    Issues of Dealing with Missing Values

    Issues of Dealing with Missing Values

    A lot of data analysis programs do not have the ability to distinguish between many values, namely:
    · Missing Values
    · Blanks
    · Zero

    This weakness of data analysis programs also extends to the failure of many data analysts to distinguish between these values, therefore, these values are not being distinguished or dealt with, and data are not being analyzed based on these differences.

    Some may think that these differences are not very important, and they ignore them and leave dealing with them to data analysis programs, but in most cases, this gives catastrophic results that many people do not realize.

    I will attempt to illustrate these differences through some examples:

    1. If we want to analyze the average income of households in a country suffering from a crisis, it was noticed that a high percentage of respondents said that they have no income of any kind, and the percentage of these respondents is over 40% of the surveyed families. Data analysts dealt with these cases as missing values, the thing that gave results that are utterly different from the situation of society, as the socio-economic indicators in this case will show, for example, that only 10% of HHs are below the extreme poverty line, but the truth is that the percentage is more than 50%, because whoever does not have any income must be considered as his income is zero rather than a missing value, because the missing value is not included in the calculations, while the value zero is, and thus affects the percentages and the general average of income. In the opposite case, in the event of asking about the monthly salary, the salary of a person who does not have a job will be considered as a missing value rather than a zero, as he is unemployed and the salary is not calculated as a zero.
    2. Many programs do not consider the blanks in the text questions as a missing value. For example, we find that the SPSS program does not consider the empty cell in the text questions as a missing value, but rather considers it a valid value, as in the Gender column, if it is a text question, the program will calculate the empty values, the thing that significantly affects results such as percentages, knowing that those who did not indicate their gender (male or female) should be considered a missing value.
    3. In the SPSS, when trying to calculate a new data column from other columns, we find that some of the codes (formulas) can deal with the missing values effectively and some formulas cannot, for example when trying to calculate the total number of the family members out of the family members of each group, and we used the (sum) formula. We notice that SPSS gives the sum result even if there is a missing value in one of the categories, while calculating as a manual sum will give the sum result as a missing value when any of the cases with a missing value is encountered.

    The cases in which there are issues in defining the missing values are unlimited, and I do not advise in any case to give the data analysis program nor the data analyst alone the freedom to guess and deal with those values, as the appropriate treatment and definition of the empty value must be determined, as we explained in the income case, the missing value must be considered as zero, while in the salary case, it must be considered a missing value, and in our third example, the empty cells of any category of family members must be considered zero, knowing that from the beginning, data collectors must be told that if a family does not have any member of a certain category, they must not leave a missing value, rather, they should fill it with a zero.

    By:
    Ghaith Albahr: CEO of INDICATORS

    Share
    • Description
    • Reviews (0)
    Description

    Issues of Dealing with Missing Values
    A lot of data analysis programs do not have the ability to distinguish between many values, namely:
    · Missing Values
    · Blanks
    · Zero

    This weakness of data analysis programs also extends to the failure of many data analysts to distinguish between these values, therefore, these values are not being distinguished or dealt with, and data are not being analyzed based on these differences.

    Some may think that these differences are not very important, and they ignore them and leave dealing with them to data analysis programs, but in most cases, this gives catastrophic results that many people do not realize.

    I will attempt to illustrate these differences through some examples:
    Issues of Dealing with Missing Values
    1. If we want to analyze the average income of households in a country suffering from a crisis, it was noticed that a high percentage of respondents said that they have no income of any kind, and the percentage of these respondents is over 40% of the surveyed families. Data analysts dealt with these cases as missing values, the thing that gave results that are utterly different from the situation of society, as the socio-economic indicators in this case will show, for example, that only 10% of HHs are below the extreme poverty line, but the truth is that the percentage is more than 50%, because whoever does not have any income must be considered as his income is zero rather than a missing value, because the missing value is not included in the calculations, while the value zero is, and thus affects the percentages and the general average of income. In the opposite case, in the event of asking about the monthly salary, the salary of a person who does not have a job will be considered as a missing value rather than a zero, as he is unemployed and the salary is not calculated as a zero.
    2. Many programs do not consider the blanks in the text questions as a missing value. For example, we find that the SPSS program does not consider the empty cell in the text questions as a missing value, but rather considers it a valid value, as in the Gender column, if it is a text question, the program will calculate the empty values, the thing that significantly affects results such as percentages, knowing that those who did not indicate their gender (male or female) should be considered a missing value.
    3. In the SPSS, when trying to calculate a new data column from other columns, we find that some of the codes (formulas) can deal with the missing values effectively and some formulas cannot, for example when trying to calculate the total number of the family members out of the family members of each group, and we used the (sum) formula. We notice that SPSS gives the sum result even if there is a missing value in one of the categories, while calculating as a manual sum will give the sum result as a missing value when any of the cases with a missing value is encountered.

    The cases in which there are issues in defining the missing values are unlimited, and I do not advise in any case to give the data analysis program nor the data analyst alone the freedom to guess and deal with those values, as the appropriate treatment and definition of the empty value must be determined, as we explained in the income case, the missing value must be considered as zero, while in the salary case, it must be considered a missing value, and in our third example, the empty cells of any category of family members must be considered zero, knowing that from the beginning, data collectors must be told that if a family does not have any member of a certain category, they must not leave a missing value, rather, they should fill it with a zero.

    By:
    Ghaith Albahr: CEO of INDICATORS

    Reviews (0)

    Reviews

    There are no reviews yet.

    Be the first to review “Issues of Dealing with Missing Values” Cancel reply

    Your email address will not be published. Required fields are marked *

    Similar publications

    Syrians' Right to Legal Documents
    Close

    Syrians’ Right to Legal Documents

    Published on: 2019-05-01نُشِرَ بتاريخ: 2019-05-01
    UNEMPLOYMENT IN SAUDI ARABIA
    Close

    UNEMPLOYMENT IN SAUDI ARABIA

    Published on: 2021-10-20نُشِرَ بتاريخ: 2021-10-20
    Outliers Processing
    Close

    Outliers Processing

    Published on: 2022-05-20نُشِرَ بتاريخ: 2022-05-20
    THE INTEGRATION BETWEEN THE IDPs AND THE HOST COMMUNITY IN IDLEB CITY
    Close

    THE INTEGRATION BETWEEN THE IDPs AND THE HOST COMMUNITY IN IDLEB CITY

    Published on: 2019-04-01نُشِرَ بتاريخ: 2019-04-01
    منصة متابعة العمل
    Close

    Work follow-up platform

    Published on: 2021-12-09نُشِرَ بتاريخ: 2021-12-09
    Close

    Accountability to Affected Population AAP

    Published on: 2019-08-01نُشِرَ بتاريخ: 2019-08-01
    Field heroes 3 - focus group discussions
    Close

    focus group discussions – Field heroes 3

    Published on: 2018-11-03نُشِرَ بتاريخ: 2018-11-03
    PEOPLE WITH DISABILITIES’ NEEDS ASSESSMENT IN GAZIANTEP
    Close

    PEOPLE WITH DISABILITIES’ NEEDS ASSESSMENT IN GAZIANTEP

    Published on: 2019-07-01نُشِرَ بتاريخ: 2019-07-01

    Contact Us

    E-mail

    [email protected]
    E-MAIL US

    Telephone number

    (0090) 537 656 74 31

    WhatsApp

    indicators-home-WhatsApp QR

    FIND US ON:

    INDICATORS

    A research entity specialized in statistical analysis and data processing which provides services in market research and feasibility studies, TPM, polls, and social research.
    • AGENDA
      • Research Studies
      • Research Guides
      • Interactive Reports
      • Blog
      • FAQs
    • MARKET RESEARCH
      • Feasibility studies
      • Marketing research
      • Product development
      • Testing customer experience
      • See More
    • MEAL & TPM
      • TPM
      • Final Evaluation
      • Humanitarian needs assessment
      • Impact analysis
      • See More
    • SOCIO-POLITICS
      • Polls
      • Indicator’s development
      • Metrics development
      • Research partnership
      • Research statistical testing

    All rights reserved 2021-2022. INDICATORS ©

    MADE WITH ❤ BY FAWAZ ABBAS
    • Market Research
      • Feasibility studies
      • Marketing research
      • Products development
      • Testing customers experience
      • Studying customers satisfaction
      • Interactive business follow-up platform
    • SOCIO-POLITICS
      • Polls
      • Indicator’s development
      • Metrics development
      • Research partnership
      • Research statistical testing
    • Meal & TPM
      • Third-Party Monitoring (TPM)
      • Final Evaluation
      • Humanitarian needs assessment
      • Impact analysis
      • Market assessment
      • Development of MEAL policies & SOPs
      • Verification
    • Training
    • Agenda
      • Research studies
      • Guides
      • Interactive reports
      • Blog
      • FAQs
    • About us
    Start typing to see posts you are looking for.