Convert text file to CSV with additional column for variance

•

0 j'aime•1,133 vues

The document describes converting a text file into a CSV file with an additional column for variance. It provides Perl source code to open the source and target files, parse the source file line by line, extract the year and count values into columns, and calculate the variance between counts which is also added as a column in the target CSV file. The output file uses tabs as separators between columns and shows the first 10 lines as an example of the transformed flat database file with year, count, and variance columns.

Technologie

Converting a text file into a CSV file
with an additional column/field
The first 11 lines of source text file
look like this --> 1990
41502 Year

1991
The objective is to
transform the file into a flat
41820
database file containing Village
the following columns: 1992 Count
Year, Count, and 41876
Variance. Variance is the
difference between the 1993
count in a row and that in
the previous row. 41931

Perl Source Code
#!/usr/bin/perl

use strict;

my $year;
my $brgy_count_1 = 0;
my $brgy_count_2 = 0;
my $counter = 1;
my $first_line = 1;

open SOURCE_FILE, quot;/path/to/source_filequot;;
open TARGET_FILE, quot;>/path/to/target_filequot;;

$Perl Source Code (2) while (<SOURCE_FILE>) { next if ($_ =~ m/^$/); if ($counter == 1 or $counter == 3) { $_ =~ m/(d{4})/; print TARGET_FILE $1 . quot;tquot;; $counter += 1; } else { $_ =~ m/(d+)$/; if ($first_line == 0) { $brgy_count_2 = $1; } print TARGET_FILE $1 . quot;tquot;. ($brgy_count_2 - $brgy_count_1) . quot;nquot;;$

$Perl Source Code (3) if ($first_line == 1) { $brgy_count_1 = $1; } else { $brgy_count_1 = $brgy_count_2; } if ($counter == 4) { $counter = 1; } else { $counter += 1; } $first_line = 0; } } close SOURCE_FILE; close TARGET_FILE;$

The new flat database file
The first 10 lines of output file
look like this --> 1990 41502 0
1991 41820 318
1992 41876 56
The output file uses a
1993 41931 55
tab as field/column 1994 41919 -12
separator. To use a 1995 41929 10
comma, just go to the 1996 41935 6
related code's line 1997 41939 4
and change the 1998 41940 1
separator from “t” to
“,”. 1999 41940 0

Village
Year Variance
Count

Contenu connexe

En vedette

Educational System in the Philippines, Quality Education and Access to EducationRose Ann Enriquez

Historical perspective of the philippine educational system lee annJerson Panopio

K to 12 electrical teacher's guideNoel Tan

The Organizational Structure of the Philippine Educational SystemGlance Ruiz

Philippine education presentationCarlo Magno

K to 12 bread and pastry teacher's guideNoel Tan

Problems and Issues in the Philippine Educational SystemJames Paglinawan

Education System of the PhilippinesCarms Celis

Deped K12Daniel Bragais

DepEd, CHED and TESDArajnulada

K to 12 General PresentationDepEdPhilippines

K to 12 Science Curriculum GuideDr. Joy Kenneth Sala Biasong

K to 12 classroom assessment pptCarlo Magno

Historical foundation of philippine education Michael John Labog

K 12 basic education program19710802

En vedette (15)

Educational System in the Philippines, Quality Education and Access to Education

Historical perspective of the philippine educational system lee ann

K to 12 electrical teacher's guide

The Organizational Structure of the Philippine Educational System

Philippine education presentation

K to 12 bread and pastry teacher's guide

Problems and Issues in the Philippine Educational System

Education System of the Philippines

Deped K12

DepEd, CHED and TESDA

K to 12 General Presentation

K to 12 Science Curriculum Guide

K to 12 classroom assessment ppt

Historical foundation of philippine education

K 12 basic education program

Dernier

"ML in Production",Oleksandr BaganFwdays

Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro

My INSURER PTE LTD - Insurtech Innovation Award 2024The Digital Insurer

Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst

Vector Databases 101 - An introduction to the world of Vector DatabasesZilliz

"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays

WordPress Websites for Engineers: Elevate Your Brandgvaughan

Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed

Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos

Powerpoint exploring the locations used in television show Time Clashcharlottematthew16

My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar

Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity

Training state-of-the-art general text embeddingZilliz

Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi

What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett

Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software

Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada

SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero

AI as an Interface for Commercial BuildingsMemoori

Install Stable Diffusion in windows machinePadma Pradeep

Dernier (20)

"ML in Production",Oleksandr Bagan

Unraveling Multimodality with Large Language Models.pdf

My INSURER PTE LTD - Insurtech Innovation Award 2024

Human Factors of XR: Using Human Factors to Design XR Systems

Vector Databases 101 - An introduction to the world of Vector Databases

"Debugging python applications inside k8s environment", Andrii Soldatenko

WordPress Websites for Engineers: Elevate Your Brand

Scanning the Internet for External Cloud Exposures via SSL Certs

Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)

Powerpoint exploring the locations used in television show Time Clash

My Hashitalk Indonesia April 2024 Presentation

Dev Dives: Streamline document processing with UiPath Studio Web

Training state-of-the-art general text embedding

Vertex AI Gemini Prompt Engineering Tips

What's New in Teams Calling, Meetings and Devices March 2024

Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation

Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024

SIP trunking in Janus @ Kamailio World 2024

AI as an Interface for Commercial Buildings

Install Stable Diffusion in windows machine

Convert text file to CSV with additional column for variance

1. Converting a text file into a CSV file with an additional column/field The first 11 lines of source text file look like this --> 1990 41502 Year 1991 The objective is to transform the file into a flat 41820 database file containing Village the following columns: 1992 Count Year, Count, and 41876 Variance. Variance is the difference between the 1993 count in a row and that in the previous row. 41931

2. Perl Source Code #!/usr/bin/perl use strict; my $year; my $brgy_count_1 = 0; my $brgy_count_2 = 0; my $counter = 1; my $first_line = 1; open SOURCE_FILE, quot;/path/to/source_filequot;; open TARGET_FILE, quot;>/path/to/target_filequot;;

3. Perl Source Code (2) while (<SOURCE_FILE>) { next if ($_ =~ m/^$/); if ($counter == 1 or $counter == 3) { $_ =~ m/(d{4})/; print TARGET_FILE $1 . quot;tquot;; $counter += 1; } else { $_ =~ m/(d+)$/; if ($first_line == 0) { $brgy_count_2 = $1; } print TARGET_FILE $1 . quot;tquot;. ($brgy_count_2 - $brgy_count_1) . quot;nquot;;

4. Perl Source Code (3) if ($first_line == 1) { $brgy_count_1 = $1; } else { $brgy_count_1 = $brgy_count_2; } if ($counter == 4) { $counter = 1; } else { $counter += 1; } $first_line = 0; } } close SOURCE_FILE; close TARGET_FILE;

5. The new flat database file The first 10 lines of output file look like this --> 1990 41502 0 1991 41820 318 1992 41876 56 The output file uses a 1993 41931 55 tab as field/column 1994 41919 -12 separator. To use a 1995 41929 10 comma, just go to the 1996 41935 6 related code's line 1997 41939 4 and change the 1998 41940 1 separator from “t” to “,”. 1999 41940 0 Village Year Variance Count

Convert text file to CSV with additional column for variance

Recommandé

Recommandé

Contenu connexe

En vedette

En vedette (15)

Dernier

Dernier (20)

Convert text file to CSV with additional column for variance