SlideShare une entreprise Scribd logo
1  sur  19
Masudul Haque
•

A Regular expression is a pattern describing a
certain amount of text.

•

A regular expression, often called a pattern, is
an expression that describes a set of strings.
- Wikipedia
•
•
•
•
•
•

Matching/Finding
Doing something with matched text
Validation of data
Case insensitive matching
Parsing data ( ex: html )
Converting data into diff. form etc.






Pattern: To create a pattern, you must first invoke
one of its public static compile methods, which
will then return a Pattern object. These methods
accept a regular expression as the first argument.
Matcher: A Matcher object is the engine that
interprets the pattern and performs match
operations against an input string.
PatternSyntaxException: A PatternSyntaxException
object is an unchecked exception that indicates a
syntax error in a regular expression pattern.
Quote the next meta-character.

^

Match at the beginning

.

Match any character except new line

$

Match at the end, before new line

|

Alternation

()

Grouping

[]

Character class

{}

Match m to n times

+

One or more times

*

Zero or more times

?

Zero or one times
t
tab
n
newline
r
return
f
form feed
a
alarm (bell)
e
escape (think troff)
033
octal char
x1B
hex char
x{263a} long hex char
cK
control char
N{name} named Unicode character

(HT, TAB)
(LF, NL)
(CR)
(FF)
(BEL)
(ESC)
(example: ESC)
(example: ESC)
(example: Unicode SMILEY)
(example: VT)
Construct

Description

[abc]

a, b, or c (simple class)

[^abc]

Any character except a, b, or c (negation)

[a-zA-Z]

a through z, or A through Z, inclusive (range)

[a-d[m-p]]

a through d, or m through p: [a-dm-p] (union)

[a-z&&[def]]

d, e, or f (intersection)

[a-z&&[^bc]]

a through z, except for b and c: [ad-z]
(subtraction)

[a-z&&[^m-p]]

a through z, and not m through p: [a-lq-z]
(subtraction)
Construct

Descriptions

.

Any character (may or may not match line terminators)

d

A digit: [0-9]

D

A non-digit: [^0-9]

s

A whitespace character: [ tnx0Bfr]

S

A non-whitespace character: [^s]

w

A word character: [a-zA-Z_0-9]

W

A non-word character: [^w]
Construct

Description

p{Lower}

A lower-case alphabetic character: [a-z]

p{Upper}

An upper-case alphabetic character:[A-Z]

p{ASCII}

All ASCII:[x00-x7F]

p{Alpha}

An alphabetic character:[p{Lower}p{Upper}]

p{Digit}

A decimal digit: [0-9]

p{Alnum}

An alphanumeric character:[p{Alpha}p{Digit}]

p{Punct}

Punctuation: One of !"#$%&'()*+,-./:;<=>?@[]^_`{|}~

p{Graph}

A visible character: [p{Alnum}p{Punct}]

p{Print}

A printable character: [p{Graph}x20]

p{Blank}

A space or a tab: [ t]

p{Cntrl}

A control character: [x00-x1Fx7F]

p{XDigit}

A hexadecimal digit: [0-9a-fA-F]

p{Space}

A whitespace character: [ tnx0Bfr]
Construct

Description

p{javaLowerCase}

Equivalent to java.lang.Character.isLowerCase()

p{javaUpperCase}

Equivalent to java.lang.Character.isUpperCase()

p{javaWhitespace}

Equivalent to java.lang.Character.isWhitespace()

p{javaMirrored}

Equivalent to java.lang.Character.isMirrored()
Construct

Description

p{IsLatin}

A Latin script character (script)

p{InGreek}

A character in the Greek block (block)

p{Lu}

An uppercase letter (category)

p{IsAlphabetic}

An alphabetic character (binary property)

p{Sc}

A currency symbol

P{InGreek}

Any character except one in the Greek block
(negation)

[p{L}&&[^p{Lu}]]

Any letter except an uppercase letter (subtraction)






Greedy quantifiers are considered "greedy" because they force the
matcher to read in, or eat, the entire input string prior to attempting
the first match.
Reluctant quantifiers, however, take the opposite approach: They
start at the beginning of the input string, then reluctantly eat one
character at a time looking for a match. The last thing they try is the
entire input string.
Possessive quantifiers always eat the entire input string, trying once
(and only once) for a match. Unlike the greedy quantifiers,
possessive quantifiers never back off, even if doing so would allow
the overall match to succeed.
Greedy

Reluctant

Possessive

Meaning

X?

X??

X?+

X, once or not at
all

X*

X*?

X*+

X, zero or more
times

X+

X+?

X++

X, one or more
times

X{n}

X{n}?

X{n}+

X, exactly n times

X{n,}

X{n,}?

X{n,}+

X, at least n times

X{n,m}+

X, at least n but
not more
than m times

X{n,m}

X{n,m}?
Construct

Description

^

The beginning of a line

$

The end of a line

b

A word boundary

B

A non-word boundary

A

The beginning of the input

G

The end of the previous match

Z

The end of the input but for the final terminator, if any

z

The end of the input
Capturing groups are a way to treat multiple
characters as a single unit.
 int groupCount()
 int start()
 int end()
 String group(int)
Backreferences
Constant

Equivalent Embedded Flag Expression

Pattern.CANON_EQ

None

Pattern.CASE_INSENSITIVE

(?i)

Pattern.COMMENTS

(?x)

Pattern.MULTILINE

(?m)

Pattern.DOTALL

(?s)

Pattern.LITERAL

None

Pattern.UNICODE_CASE

(?u)

Pattern.UNIX_LINES

(?d)
Index Methods

Index methods provide useful index values that show

precisely where the match was found in the input
string:
 public int start(): Returns the start index of the
previous match.
 public int start(int group): Returns the start index of
the subsequence captured by the given group during
the previous match operation.
 public int end(): Returns the offset after the last
character matched.
 public int end(int group): Returns the offset after the
last character of the subsequence captured by the
given group during the previous match operation.
Study Methods









Study methods review the input string and return a

boolean indicating whether or not the pattern is found.
public boolean lookingAt(): Attempts to match the input
sequence, starting at the beginning of the region, against
the pattern.
public boolean find(): Attempts to find the next
subsequence of the input sequence that matches the
pattern.
public boolean find(int start): Resets this matcher and then
attempts to find the next subsequence of the input
sequence that matches the pattern, starting at the
specified index.
public boolean matches(): Attempts to match the entire
region against the pattern.
Replacement Methods

Replacement methods are useful methods for replacing text in an

input string.
 public Matcher appendReplacement(StringBuffer sb, String
replacement): Implements a non-terminal append-and-replace
step.
 public StringBuffer appendTail(StringBuffer sb): Implements a
terminal append-and-replace step.
 public String replaceAll(String replacement): Replaces every
subsequence of the input sequence that matches the pattern
with the given replacement string.
 public String replaceFirst(String replacement): Replaces the first
subsequence of the input sequence that matches the pattern
with the given replacement string.
 public static String quoteReplacement(String s): Returns a literal
replacement String for the specified String. This method
produces a String that will work as a literal replacement s in
the appendReplacement method of the Matcher class.

Contenu connexe

Tendances

This keyword in java
This keyword in javaThis keyword in java
This keyword in javaHitesh Kumar
 
Java 8 Lambda Built-in Functional Interfaces
Java 8 Lambda Built-in Functional InterfacesJava 8 Lambda Built-in Functional Interfaces
Java 8 Lambda Built-in Functional InterfacesGanesh Samarthyam
 
javascript objects
javascript objectsjavascript objects
javascript objectsVijay Kalyan
 
java 8 new features
java 8 new features java 8 new features
java 8 new features Rohit Verma
 
Function template
Function templateFunction template
Function templateKousalya M
 
Java collections concept
Java collections conceptJava collections concept
Java collections conceptkumar gaurav
 
Collections in Java
Collections in JavaCollections in Java
Collections in JavaKhasim Cise
 
Interface in java ,multiple inheritance in java, interface implementation
Interface in java ,multiple inheritance in java, interface implementationInterface in java ,multiple inheritance in java, interface implementation
Interface in java ,multiple inheritance in java, interface implementationHoneyChintal
 
Angular 8
Angular 8 Angular 8
Angular 8 Sunil OS
 
Classes, objects in JAVA
Classes, objects in JAVAClasses, objects in JAVA
Classes, objects in JAVAAbhilash Nair
 
JavaScript - Chapter 15 - Debugging Techniques
 JavaScript - Chapter 15 - Debugging Techniques JavaScript - Chapter 15 - Debugging Techniques
JavaScript - Chapter 15 - Debugging TechniquesWebStackAcademy
 
Javascript variables and datatypes
Javascript variables and datatypesJavascript variables and datatypes
Javascript variables and datatypesVarun C M
 
L11 array list
L11 array listL11 array list
L11 array listteach4uin
 

Tendances (20)

Delegates and events in C#
Delegates and events in C#Delegates and events in C#
Delegates and events in C#
 
JAVA OOP
JAVA OOPJAVA OOP
JAVA OOP
 
This keyword in java
This keyword in javaThis keyword in java
This keyword in java
 
Jquery
JqueryJquery
Jquery
 
Java 8 Lambda Built-in Functional Interfaces
Java 8 Lambda Built-in Functional InterfacesJava 8 Lambda Built-in Functional Interfaces
Java 8 Lambda Built-in Functional Interfaces
 
javascript objects
javascript objectsjavascript objects
javascript objects
 
C# operators
C# operatorsC# operators
C# operators
 
java 8 new features
java 8 new features java 8 new features
java 8 new features
 
Function template
Function templateFunction template
Function template
 
Java collections concept
Java collections conceptJava collections concept
Java collections concept
 
07 java collection
07 java collection07 java collection
07 java collection
 
Collections in Java
Collections in JavaCollections in Java
Collections in Java
 
Interface in java ,multiple inheritance in java, interface implementation
Interface in java ,multiple inheritance in java, interface implementationInterface in java ,multiple inheritance in java, interface implementation
Interface in java ,multiple inheritance in java, interface implementation
 
Angular 8
Angular 8 Angular 8
Angular 8
 
Classes, objects in JAVA
Classes, objects in JAVAClasses, objects in JAVA
Classes, objects in JAVA
 
JavaScript - Chapter 15 - Debugging Techniques
 JavaScript - Chapter 15 - Debugging Techniques JavaScript - Chapter 15 - Debugging Techniques
JavaScript - Chapter 15 - Debugging Techniques
 
Strings in Java
Strings in JavaStrings in Java
Strings in Java
 
C# Strings
C# StringsC# Strings
C# Strings
 
Javascript variables and datatypes
Javascript variables and datatypesJavascript variables and datatypes
Javascript variables and datatypes
 
L11 array list
L11 array listL11 array list
L11 array list
 

Similaire à Java: Regular Expression

Regular expressions
Regular expressionsRegular expressions
Regular expressionsRaj Gupta
 
regular-expression.pdf
regular-expression.pdfregular-expression.pdf
regular-expression.pdfDarellMuchoko
 
Regular Expression
Regular ExpressionRegular Expression
Regular ExpressionBharat17485
 
SQL for pattern matching (Oracle 12c)
SQL for pattern matching (Oracle 12c)SQL for pattern matching (Oracle 12c)
SQL for pattern matching (Oracle 12c)Logan Palanisamy
 
unit-4 regular expression.pptx
unit-4 regular expression.pptxunit-4 regular expression.pptx
unit-4 regular expression.pptxPadreBhoj
 
Python - Regular Expressions
Python - Regular ExpressionsPython - Regular Expressions
Python - Regular ExpressionsMukesh Tekwani
 
JavaScript - Chapter 9 - TypeConversion and Regular Expressions
 JavaScript - Chapter 9 - TypeConversion and Regular Expressions  JavaScript - Chapter 9 - TypeConversion and Regular Expressions
JavaScript - Chapter 9 - TypeConversion and Regular Expressions WebStackAcademy
 
Regular expressions in oracle
Regular expressions in oracleRegular expressions in oracle
Regular expressions in oracleLogan Palanisamy
 
Regular_Expressions.pptx
Regular_Expressions.pptxRegular_Expressions.pptx
Regular_Expressions.pptxDurgaNayak4
 
regex.pptx
regex.pptxregex.pptx
regex.pptxqnuslv
 
Regular expressions in Python
Regular expressions in PythonRegular expressions in Python
Regular expressions in PythonSujith Kumar
 
16 Java Regex
16 Java Regex16 Java Regex
16 Java Regexwayn
 
Python regular expressions
Python regular expressionsPython regular expressions
Python regular expressionsKrishna Nanda
 

Similaire à Java: Regular Expression (20)

Adv. python regular expression by Rj
Adv. python regular expression by RjAdv. python regular expression by Rj
Adv. python regular expression by Rj
 
Regular expressions
Regular expressionsRegular expressions
Regular expressions
 
regular-expression.pdf
regular-expression.pdfregular-expression.pdf
regular-expression.pdf
 
Regular Expression
Regular ExpressionRegular Expression
Regular Expression
 
SQL for pattern matching (Oracle 12c)
SQL for pattern matching (Oracle 12c)SQL for pattern matching (Oracle 12c)
SQL for pattern matching (Oracle 12c)
 
unit-4 regular expression.pptx
unit-4 regular expression.pptxunit-4 regular expression.pptx
unit-4 regular expression.pptx
 
2.regular expressions
2.regular expressions2.regular expressions
2.regular expressions
 
Strings
StringsStrings
Strings
 
Python - Regular Expressions
Python - Regular ExpressionsPython - Regular Expressions
Python - Regular Expressions
 
Computer programming 2 Lesson 12
Computer programming 2  Lesson 12Computer programming 2  Lesson 12
Computer programming 2 Lesson 12
 
JavaScript - Chapter 9 - TypeConversion and Regular Expressions
 JavaScript - Chapter 9 - TypeConversion and Regular Expressions  JavaScript - Chapter 9 - TypeConversion and Regular Expressions
JavaScript - Chapter 9 - TypeConversion and Regular Expressions
 
Regular expressions in oracle
Regular expressions in oracleRegular expressions in oracle
Regular expressions in oracle
 
PHP Web Programming
PHP Web ProgrammingPHP Web Programming
PHP Web Programming
 
Regular_Expressions.pptx
Regular_Expressions.pptxRegular_Expressions.pptx
Regular_Expressions.pptx
 
WD programs descriptions.docx
WD programs descriptions.docxWD programs descriptions.docx
WD programs descriptions.docx
 
Regex1.1.pptx
Regex1.1.pptxRegex1.1.pptx
Regex1.1.pptx
 
regex.pptx
regex.pptxregex.pptx
regex.pptx
 
Regular expressions in Python
Regular expressions in PythonRegular expressions in Python
Regular expressions in Python
 
16 Java Regex
16 Java Regex16 Java Regex
16 Java Regex
 
Python regular expressions
Python regular expressionsPython regular expressions
Python regular expressions
 

Plus de Masudul Haque

Plus de Masudul Haque (6)

Websocket
WebsocketWebsocket
Websocket
 
Java 9 new features
Java 9 new featuresJava 9 new features
Java 9 new features
 
Java 8 lambda
Java 8 lambdaJava 8 lambda
Java 8 lambda
 
Java-7: Collections
Java-7: CollectionsJava-7: Collections
Java-7: Collections
 
Java-7 Concurrency
Java-7 ConcurrencyJava-7 Concurrency
Java-7 Concurrency
 
Basic java
Basic javaBasic java
Basic java
 

Dernier

Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfLoriGlavin3
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESmohitsingh558521
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxLoriGlavin3
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfMounikaPolabathina
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxBkGupta21
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell
 

Dernier (20)

Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdf
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdf
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptx
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine Tuning
 

Java: Regular Expression

  • 2. • A Regular expression is a pattern describing a certain amount of text. • A regular expression, often called a pattern, is an expression that describes a set of strings. - Wikipedia
  • 3. • • • • • • Matching/Finding Doing something with matched text Validation of data Case insensitive matching Parsing data ( ex: html ) Converting data into diff. form etc.
  • 4.    Pattern: To create a pattern, you must first invoke one of its public static compile methods, which will then return a Pattern object. These methods accept a regular expression as the first argument. Matcher: A Matcher object is the engine that interprets the pattern and performs match operations against an input string. PatternSyntaxException: A PatternSyntaxException object is an unchecked exception that indicates a syntax error in a regular expression pattern.
  • 5. Quote the next meta-character. ^ Match at the beginning . Match any character except new line $ Match at the end, before new line | Alternation () Grouping [] Character class {} Match m to n times + One or more times * Zero or more times ? Zero or one times
  • 6. t tab n newline r return f form feed a alarm (bell) e escape (think troff) 033 octal char x1B hex char x{263a} long hex char cK control char N{name} named Unicode character (HT, TAB) (LF, NL) (CR) (FF) (BEL) (ESC) (example: ESC) (example: ESC) (example: Unicode SMILEY) (example: VT)
  • 7. Construct Description [abc] a, b, or c (simple class) [^abc] Any character except a, b, or c (negation) [a-zA-Z] a through z, or A through Z, inclusive (range) [a-d[m-p]] a through d, or m through p: [a-dm-p] (union) [a-z&&[def]] d, e, or f (intersection) [a-z&&[^bc]] a through z, except for b and c: [ad-z] (subtraction) [a-z&&[^m-p]] a through z, and not m through p: [a-lq-z] (subtraction)
  • 8. Construct Descriptions . Any character (may or may not match line terminators) d A digit: [0-9] D A non-digit: [^0-9] s A whitespace character: [ tnx0Bfr] S A non-whitespace character: [^s] w A word character: [a-zA-Z_0-9] W A non-word character: [^w]
  • 9. Construct Description p{Lower} A lower-case alphabetic character: [a-z] p{Upper} An upper-case alphabetic character:[A-Z] p{ASCII} All ASCII:[x00-x7F] p{Alpha} An alphabetic character:[p{Lower}p{Upper}] p{Digit} A decimal digit: [0-9] p{Alnum} An alphanumeric character:[p{Alpha}p{Digit}] p{Punct} Punctuation: One of !"#$%&'()*+,-./:;<=>?@[]^_`{|}~ p{Graph} A visible character: [p{Alnum}p{Punct}] p{Print} A printable character: [p{Graph}x20] p{Blank} A space or a tab: [ t] p{Cntrl} A control character: [x00-x1Fx7F] p{XDigit} A hexadecimal digit: [0-9a-fA-F] p{Space} A whitespace character: [ tnx0Bfr]
  • 10. Construct Description p{javaLowerCase} Equivalent to java.lang.Character.isLowerCase() p{javaUpperCase} Equivalent to java.lang.Character.isUpperCase() p{javaWhitespace} Equivalent to java.lang.Character.isWhitespace() p{javaMirrored} Equivalent to java.lang.Character.isMirrored()
  • 11. Construct Description p{IsLatin} A Latin script character (script) p{InGreek} A character in the Greek block (block) p{Lu} An uppercase letter (category) p{IsAlphabetic} An alphabetic character (binary property) p{Sc} A currency symbol P{InGreek} Any character except one in the Greek block (negation) [p{L}&&[^p{Lu}]] Any letter except an uppercase letter (subtraction)
  • 12.    Greedy quantifiers are considered "greedy" because they force the matcher to read in, or eat, the entire input string prior to attempting the first match. Reluctant quantifiers, however, take the opposite approach: They start at the beginning of the input string, then reluctantly eat one character at a time looking for a match. The last thing they try is the entire input string. Possessive quantifiers always eat the entire input string, trying once (and only once) for a match. Unlike the greedy quantifiers, possessive quantifiers never back off, even if doing so would allow the overall match to succeed.
  • 13. Greedy Reluctant Possessive Meaning X? X?? X?+ X, once or not at all X* X*? X*+ X, zero or more times X+ X+? X++ X, one or more times X{n} X{n}? X{n}+ X, exactly n times X{n,} X{n,}? X{n,}+ X, at least n times X{n,m}+ X, at least n but not more than m times X{n,m} X{n,m}?
  • 14. Construct Description ^ The beginning of a line $ The end of a line b A word boundary B A non-word boundary A The beginning of the input G The end of the previous match Z The end of the input but for the final terminator, if any z The end of the input
  • 15. Capturing groups are a way to treat multiple characters as a single unit.  int groupCount()  int start()  int end()  String group(int) Backreferences
  • 16. Constant Equivalent Embedded Flag Expression Pattern.CANON_EQ None Pattern.CASE_INSENSITIVE (?i) Pattern.COMMENTS (?x) Pattern.MULTILINE (?m) Pattern.DOTALL (?s) Pattern.LITERAL None Pattern.UNICODE_CASE (?u) Pattern.UNIX_LINES (?d)
  • 17. Index Methods Index methods provide useful index values that show precisely where the match was found in the input string:  public int start(): Returns the start index of the previous match.  public int start(int group): Returns the start index of the subsequence captured by the given group during the previous match operation.  public int end(): Returns the offset after the last character matched.  public int end(int group): Returns the offset after the last character of the subsequence captured by the given group during the previous match operation.
  • 18. Study Methods      Study methods review the input string and return a boolean indicating whether or not the pattern is found. public boolean lookingAt(): Attempts to match the input sequence, starting at the beginning of the region, against the pattern. public boolean find(): Attempts to find the next subsequence of the input sequence that matches the pattern. public boolean find(int start): Resets this matcher and then attempts to find the next subsequence of the input sequence that matches the pattern, starting at the specified index. public boolean matches(): Attempts to match the entire region against the pattern.
  • 19. Replacement Methods Replacement methods are useful methods for replacing text in an input string.  public Matcher appendReplacement(StringBuffer sb, String replacement): Implements a non-terminal append-and-replace step.  public StringBuffer appendTail(StringBuffer sb): Implements a terminal append-and-replace step.  public String replaceAll(String replacement): Replaces every subsequence of the input sequence that matches the pattern with the given replacement string.  public String replaceFirst(String replacement): Replaces the first subsequence of the input sequence that matches the pattern with the given replacement string.  public static String quoteReplacement(String s): Returns a literal replacement String for the specified String. This method produces a String that will work as a literal replacement s in the appendReplacement method of the Matcher class.