Skip to main content

Regular Expressions in Python

 


Learning Sections          show

Regular Expressions in Python

Regular expressions (regex) are a powerful tool for matching patterns in text. Python provides the re module to work with regular expressions.


Basic Functions in the re Module
  • re.search(pattern, string): Searches for the first occurrence of the pattern within the string. Returns a match object if found, else None.
  • re.match(pattern, string): Checks for a match only at the beginning of the string. Returns a match object if found, else None.
  • re.findall(pattern, string): Returns a list of all non-overlapping matches of the pattern in the string.
  • re.finditer(pattern, string): Returns an iterator yielding match objects over all non-overlapping matches.
  • re.sub(pattern, repl, string): Replaces the matches with the specified replacement string.

Using re.search

import re

# Search for a pattern within a string
pattern = r'\bhello\b'
text = 'hello world'
match = re.search(pattern, text)

if match:
    print('Match found:', match.group())
else:
    print('No match found')
    

Using re.match

# Check for a match at the beginning of the string
pattern = r'world'
text = 'hello world'
match = re.match(pattern, text)

if match:
    print('Match found:', match.group())
else:
    print('No match found')
    

Using re.findall

# Find all non-overlapping matches in the string
pattern = r'\b\w+\b'
text = 'hello world'
matches = re.findall(pattern, text)
print(matches)  # Output: ['hello', 'world']
    

Using re.sub

# Replace matches with a replacement string
pattern = r'\bhello\b'
replacement = 'hi'
text = 'hello world'
new_text = re.sub(pattern, replacement, text)
print(new_text)  # Output: 'hi world'
    

Regular Expression Syntax
  • .: Matches any character except a newline.
  • \d: Matches any digit.
  • \w: Matches any word character (alphanumeric + underscore).
  • \s: Matches any whitespace character.
  • \b: Matches a word boundary.
  • ^: Matches the start of the string.
  • $: Matches the end of the string.
  • +: Matches one or more repetitions of the preceding character.
  • *: Matches zero or more repetitions of the preceding character.
  • ?: Matches zero or one repetition of the preceding character.
  • {n}: Matches exactly n repetitions of the preceding character.
  • {n, m}: Matches between n and m repetitions of the preceding character.

Popular posts from this blog

Introduction to Python Programming

  Learning Sections      show History of Python Python was created by Guido van Rossum and first released in 1991. He wanted to create a language that was easy to read and simple to use. The name "Python" comes from the British comedy series "Monty Python's Flying Circus". Key Features of Python Readability: Python's syntax is clear and easy to read. Ease of Learning: Python is straightforward, making it great for beginners. High-Level Language: Python handles much of the complexity of the computer’s operations. Interpreted Language: Python runs code line-by-line, which makes debugging easier. Dynamically Typed: You don’t need to declare variable types. Extensive Standard Library: Python has many built-in modules for various tasks. Portability: ...

Learn Python

  Learning Sections Introduction to Python Comment, escape sequence and print statement in Python Variables and Data Types in Python Typecasting in Python User input in Python String slicing and operations on string in Python String methods in Python If else conditional statements in Python Match case statement in Python For loops in Python While loops in Python Break and continue statement in Python Functions in Python Function Arguments in Python introduction to lists in Python List methods in Python Tuples in Python Operations on tuple in Python f strings in Python Docstrings in Python Recursion in Python Sets in Python Set methods in Python Dictionaries in Python for Loop with else in Python Exception Handling in Python Finally keyword in Python Raising custom errors in Python Short hand if else statements Enumerate Function in Python Virtual Environment in Python How import works in Python if __nam...

Comment, escape sequence and print statement in Python

Learning Sections      show 1. Comments in Python Comments are notes in the code that the Python interpreter ignores. They are used to explain and document the code, making it easier to understand and maintain. Single-line comments: Begin with the # symbol. Multi-line comments: Typically use triple quotes ''' or """ . # Single-line comment print ( "Hello, World!" ) # This comment is on the same line as the code """ Multi-line comment: This spans multiple lines. The Python interpreter will ignore these lines. """ print ( "Multi-line comments are often used for documentation." ) 2. Escape Sequences in Python Escape sequences are used to insert special characters into strings that are otherwise difficult to include directly. An escape sequence begins with a backslash ( \ ) followed by one or more characters. # Using escape s...