Docsity
Docsity

Prepare for your exams
Prepare for your exams

Study with the several resources on Docsity


Earn points to download
Earn points to download

Earn points by helping other students or get them with a premium plan


Guidelines and tips
Guidelines and tips

Understanding Data Representation: Binary, Numeric, Text Codes, and Compression, Summaries of Computer Networks

Data Structures and AlgorithmsDatabase SystemsDigital Logic and ElectronicsComputer Architecture

An introduction to data representation, focusing on the conversion of information into digital data using binary digits, numeric and text codes, and data compression techniques. Learn about the importance of data representation, digitization, and the evolution of digital devices, as well as the basics of binary number systems and various text encoding schemes.

What you will learn

  • How is data digitized and what is the process of converting information into binary digits?
  • What is data representation and why is it important?
  • What are the advantages and disadvantages of lossless and lossy data compression techniques?
  • How do you extract or unzip compressed files?
  • What are the different types of numeric and text codes used to represent data?

Typology: Summaries

2018/2019

Uploaded on 12/21/2019

hun-kimhong
hun-kimhong 🇰🇭

2 documents

1 / 13

Toggle sidebar

Related documents


Partial preview of the text

Download Understanding Data Representation: Binary, Numeric, Text Codes, and Compression and more Summaries Computer Networks in PDF only on Docsity! 1 CSC 170 – Introduction to Computers and Their Applications Lecture #1 – Digital Basics Data Representation • Data refers to the symbols that represent people, events, things, and ideas. Data can be a name, a number, the colors in a photograph, or the notes in a musical composition. • Data Representation refers to the form in which data is stored, processed, and transmitted. • Devices such as smartphones, iPods, and computers store data in digital formats that can be handled by electronic circuitry. 2 Data Representation • Digitization is the process of converting information, such as text, numbers, photo, or music, into digital data that can be manipulated by electronic devices. • The Digital Revolution has evolved through four phases, beginning with big, expensive, standalone computers, and progressing to today’s digital world in which small, inexpensive digital devices are everywhere. Data Representation • The 0s and 1s used to represent digital data are referred to as binary digits — from this term we get the word bit that stands for binary digit. • A bit is a 0 or 1 used in the digital representation of data. • A digital file, usually referred to simply as a file, is a named collection of data that exits on a storage medium, such as a hard disk, CD, DVD, or flash drive. 5 Representing Text • Digital devices employ several types of codes to represent character data, including ASCII, Unicode, and their variants. • ASCII (American Standard Code for Information Interchange, pronounced “ASK ee”) requires seven bits for each character. • The ASCII code for an uppercase A is 1000001. Representing Text • Extended ASCII is a superset of ASCII that uses eight bits for each character. • For example, Extended ASCII represents the uppercase letter A as 01000001. • Using eight bits instead of seven bits allows Extended ASCII to provide codes for 256 characters. 6 Representing Text • Unicode (pronounced “YOU ni code”) uses sixteen bits and provides codes or 65,000 characters. • This is a bonus for representing the alphabets of multiple languages. • UTF-8 is a variable-length coding scheme that uses seven bits for common ASCII characters but uses sixteen-bit Unicode as necessary. Representing Text 00100000 Space 00110011 3 01000110 F 01011001 Y 01101100 l 00100001 ! 00110100 4 01000111 G 01011010 Z 01101101 m 00100010 " 00110101 5 01001000 H 01011011 [ 01101110 n 00100011 # 00110110 6 01001001 I 01011100 \ 01101111 o 00100100 $ 00110111 7 01001010 J 01011101 ] 01110000 p 00100101 % 00111000 8 01001011 K 01011110 ^ 01110001 q 00100110 & 00111001 9 01001100 L 01011111 _ 01110010 r 00100111 ' 00111010 : 01001101 M 01100000 ` 01110011 s 00101000 ( 00111011 ; 01001110 N 01100001 a 01110100 t 00101001 ) 00111100 < 01001111 O 01100010 b 01110101 u 00101010 * 00111101 = 01010000 P 01100011 c 01110110 v 00101011 + 00111110 > 01010001 Q 01100100 d 01110111 w 00101100 , 00111111 ? 01010010 R 01100101 e 01111000 x 00101101 - 01000000 @ 01010011 S 01100110 f 01111001 y 00101110 . 01000001 A 01010100 T 01100111 g 01111010 z 00101111 / 01000010 B 01010101 U 01101000 h 01111011 { 00110000 0 01000011 C 01010110 V 01101001 i 01111100 | 00110001 1 01000100 D 01010111 W 01101010 j 01111101 } 00110010 2 01000101 E 01011000 X 01101011 k 01111110 ~ 7 Representing Text • ASCII codes are used for numerals, such as Social Security numbers and phone numbers. • Plain, unformatted text is sometimes called ASCII text and is stored in a so-called text file with a name ending in .txt. • On Apple devices these files are labeled “Plain Text.” In Windows, these files are labeled “Text Document”. Representing Text • ASCII text files contain no formatting. • To create documents with styles and formats, formatting codes have to be embedded in the text. 10 Bites and Bytes Bit One binary digit Byte 8 bits Kilobit 1,024 or 210 bits Kilobyte 1,024 or 210 bytes Megabit 1,048,576 or 220 bits Megabyte 1,048,576 or 220 bytes Gigabit 230 bits Gigabyte 230 bytes Terabyte 240 bytes Petabyte 250 bytes Exabyte 260 bytes Bites and Bytes • Use bits for data rates, such as Internet connection speeds, and movie download speeds. • Use bytes for file sizes and storage capacities. • 104 KB: Kilobyte (KB or Kbyte) is often used when referring to the size of small computer files. 11 Bites and Bytes • 56 Kbps: Kilobit (Kb or Kbit) can be used for slow data rates, such as a 56 Kbps (kilobits per second) dial-up connection. • 50 Mbps: Megabit (Mb or Mbit) is used for faster data rates, such as a 50 Mbps (megabits per second) Internet connection. Bites and Bytes • 3.2 MB: Megabyte (MB or MByte) is typically used when referring to the size of files containing photos and videos. • 100 Gbit: Gigabit (Gb or Gbit) is used for really fast network speeds. • 16 GB: Gigabyte (GB or GByte) is commonly used to refer to storage capacity 12 Data Compression • To reduce file size and transmission times, digital data can be compressed. • Data compression refers to any technique that recodes the data in a file so that it contains fewer bits. • Compression is commonly referred to as “zipping.” Data Compression • Compression techniques divided into two categories: lossless and lossy • Lossless compression provides a way to compress data and reconstitute it into its original state; uncompressed data stays exactly the same as the original data • Lossy compression throws away some of the original data during the compression process; uncompressed data is not exactly the same as the original
Docsity logo



Copyright © 2024 Ladybird Srl - Via Leonardo da Vinci 16, 10126, Torino, Italy - VAT 10816460017 - All rights reserved