Skip to main content

Regular Expressions in Python

 


Learning Sections          show

Regular Expressions in Python

Regular expressions (regex) are a powerful tool for matching patterns in text. Python provides the re module to work with regular expressions.


Basic Functions in the re Module
  • re.search(pattern, string): Searches for the first occurrence of the pattern within the string. Returns a match object if found, else None.
  • re.match(pattern, string): Checks for a match only at the beginning of the string. Returns a match object if found, else None.
  • re.findall(pattern, string): Returns a list of all non-overlapping matches of the pattern in the string.
  • re.finditer(pattern, string): Returns an iterator yielding match objects over all non-overlapping matches.
  • re.sub(pattern, repl, string): Replaces the matches with the specified replacement string.

Using re.search

import re

# Search for a pattern within a string
pattern = r'\bhello\b'
text = 'hello world'
match = re.search(pattern, text)

if match:
    print('Match found:', match.group())
else:
    print('No match found')
    

Using re.match

# Check for a match at the beginning of the string
pattern = r'world'
text = 'hello world'
match = re.match(pattern, text)

if match:
    print('Match found:', match.group())
else:
    print('No match found')
    

Using re.findall

# Find all non-overlapping matches in the string
pattern = r'\b\w+\b'
text = 'hello world'
matches = re.findall(pattern, text)
print(matches)  # Output: ['hello', 'world']
    

Using re.sub

# Replace matches with a replacement string
pattern = r'\bhello\b'
replacement = 'hi'
text = 'hello world'
new_text = re.sub(pattern, replacement, text)
print(new_text)  # Output: 'hi world'
    

Regular Expression Syntax
  • .: Matches any character except a newline.
  • \d: Matches any digit.
  • \w: Matches any word character (alphanumeric + underscore).
  • \s: Matches any whitespace character.
  • \b: Matches a word boundary.
  • ^: Matches the start of the string.
  • $: Matches the end of the string.
  • +: Matches one or more repetitions of the preceding character.
  • *: Matches zero or more repetitions of the preceding character.
  • ?: Matches zero or one repetition of the preceding character.
  • {n}: Matches exactly n repetitions of the preceding character.
  • {n, m}: Matches between n and m repetitions of the preceding character.

Popular posts from this blog

Learn Python

  Learning Sections Introduction to Python Comment, escape sequence and print statement in Python Variables and Data Types in Python Typecasting in Python User input in Python String slicing and operations on string in Python String methods in Python If else conditional statements in Python Match case statement in Python For loops in Python While loops in Python Break and continue statement in Python Functions in Python Function Arguments in Python introduction to lists in Python List methods in Python Tuples in Python Operations on tuple in Python f strings in Python Docstrings in Python Recursion in Python Sets in Python Set methods in Python Dictionaries in Python for Loop with else in Python Exception Handling in Python Finally keyword in Python Raising custom errors in Python Short hand if else statements Enumerate Function in Python Virtual Environment in Python How import works in Python if __nam...

MultiProcessing in Python

  Learning Sections          show MultiProcessing in Python Multiprocessing in Python involves using the multiprocessing module to run multiple processes concurrently, taking advantage of multiple CPU cores. This module provides a higher level of concurrency than threading and is especially useful for CPU-bound tasks. Creating Processes You can create and start a new process by using the multiprocessing module: import multiprocessing def print_numbers (): for i in range ( 10 ): print ( i ) p1 = multiprocessing.Process ( target = print_numbers ) p1 . start () p1 . join () # Wait for the process to complete Using Process Pools The multiprocessing module provides a Pool class, which allows you to manage a pool of worker processes: from multiprocessing import Pool def square ( n ): return n * n with Pool ( 4 ) as pool : result = pool.map ( square , range (...

Match Case Statement in Python

  Learning Sections      show Match-Case Statements In Python 3.10 and later, the match-case statement was introduced as a way to perform pattern matching, similar to switch-case statements in other programming languages. It allows you to check the value of a variable against multiple patterns and execute corresponding blocks of code. Basic Match-Case Statement The match statement is followed by an expression and several case clauses. Each case specifies a pattern and an action to be taken if the pattern matches the value of the expression. # Basic match-case statement example command = "start" match command : case "start" : print ( "Starting..." ) case "stop" : print ( "Stopping..." ) case "pause" : print ( "Pausing..." ) case _: print ( "Unknown command" ) Matching Multiple Patterns Using the pipe sy...