Skip to main content

Regular Expressions in Python

 


Learning Sections          show

Regular Expressions in Python

Regular expressions (regex) are a powerful tool for matching patterns in text. Python provides the re module to work with regular expressions.


Basic Functions in the re Module
  • re.search(pattern, string): Searches for the first occurrence of the pattern within the string. Returns a match object if found, else None.
  • re.match(pattern, string): Checks for a match only at the beginning of the string. Returns a match object if found, else None.
  • re.findall(pattern, string): Returns a list of all non-overlapping matches of the pattern in the string.
  • re.finditer(pattern, string): Returns an iterator yielding match objects over all non-overlapping matches.
  • re.sub(pattern, repl, string): Replaces the matches with the specified replacement string.

Using re.search

import re

# Search for a pattern within a string
pattern = r'\bhello\b'
text = 'hello world'
match = re.search(pattern, text)

if match:
    print('Match found:', match.group())
else:
    print('No match found')
    

Using re.match

# Check for a match at the beginning of the string
pattern = r'world'
text = 'hello world'
match = re.match(pattern, text)

if match:
    print('Match found:', match.group())
else:
    print('No match found')
    

Using re.findall

# Find all non-overlapping matches in the string
pattern = r'\b\w+\b'
text = 'hello world'
matches = re.findall(pattern, text)
print(matches)  # Output: ['hello', 'world']
    

Using re.sub

# Replace matches with a replacement string
pattern = r'\bhello\b'
replacement = 'hi'
text = 'hello world'
new_text = re.sub(pattern, replacement, text)
print(new_text)  # Output: 'hi world'
    

Regular Expression Syntax
  • .: Matches any character except a newline.
  • \d: Matches any digit.
  • \w: Matches any word character (alphanumeric + underscore).
  • \s: Matches any whitespace character.
  • \b: Matches a word boundary.
  • ^: Matches the start of the string.
  • $: Matches the end of the string.
  • +: Matches one or more repetitions of the preceding character.
  • *: Matches zero or more repetitions of the preceding character.
  • ?: Matches zero or one repetition of the preceding character.
  • {n}: Matches exactly n repetitions of the preceding character.
  • {n, m}: Matches between n and m repetitions of the preceding character.

Popular posts from this blog

Introduction to OOPs in Python

  Learning Sections          show Introduction to Object-Oriented Programming (OOP) Object-Oriented Programming (OOP) is a programming paradigm that organizes software design around objects rather than actions and data rather than logic. It revolves around the concept of "objects", which are instances of classes. These objects encapsulate data, in the form of attributes or properties, and behaviors, in the form of methods or functions. OOP promotes modularity, reusability, and extensibility in software development. Key Concepts of OOP: Class: A class is a blueprint or template for creating objects. It defines the attributes (data) and methods (functions) that will characterize any object instantiated from that class. Object: An object is an instance of a class. It is a concrete realization of the class blueprint, containing actual values instead of placeholders for attributes. Encapsulation: Encapsulation is ...

Inheritance in Python

  Learning Sections          show Inheritance in Python Inheritance is a fundamental concept in object-oriented programming (OOP) that allows a class to inherit attributes and methods from another class. The class that inherits is called the child class or subclass, and the class being inherited from is called the parent class or superclass. Basic Inheritance In Python, a child class inherits from a parent class by specifying the parent class in parentheses after the child class name. Example: class Animal : def __init__ ( self , name ): self . name = name def speak ( self ): raise NotImplementedError ( "Subclass must implement this method" ) class Dog ( Animal ): def speak ( self ): return "Woof!" class Cat ( Animal ): def speak ( self ): return "Meow!" # Create instances of Dog and Cat dog = Dog ( "Buddy" ) cat = Cat ( "Whiskers" ...

read(), readlines() and other methods in Python

Learning Sections          show read(), readlines() and Other Methods in Python Python provides several methods to read from and manipulate files. Here are some common methods: 1. read() The read() method reads the entire content of a file and returns it as a string. # Open the file in read mode with open ( 'example.txt' , 'r' ) as file : # Read the entire content of the file content = file . read () print ( content ) 2. readlines() The readlines() method reads all the lines of a file and returns a list where each element is a line in the file. # Open the file in read mode with open ( 'example.txt' , 'r' ) as file : # Read all lines of the file lines = file . readlines () for line in lines : print ( line . strip ()) # strip() removes the newline character 3. readline() The readline() method reads one line from the file and returns it as a...