Definitions

Algorithm - a well defined computational procedure that takes some value or set of values and produces some value in a finite amount of time.

Loop Invariance - A form of correctness proof that relies on mathematical induction

Initialization

That the algorithm holds true before we start the iterations

For insertion sort: before we start the algorithm we have a single sorted element

Maintenance

That the algorithm remains true in the next iteration of the for loop

Termination

The algorithm terminates with the correct sequence that the algorithm sought to achieve

Asymptotic Analysis

There are some bounds that can be drawn with constant leaders that can restrict the domain of f(n) from above and below

Big O, Theta, and Omega Notation - 8/21

Time Constraint Equivalencies

Function Asymptotic Constraint
log^k n o(n^i)
n ^ k o(c^n)
n^(log c) Θ(c^(log n))
log(n!) Θ(n log n)
sqrt(n) n^(sin n)
--------- ---------------------

Function	Asymptotic Constraint
log^k n	o(n^i)
n ^ k	o(c^n)
n^(log c)	Θ(c^(log n))
log(n!)	Θ(n log n)
sqrt(n)	n^(sin n)
---------	---------------------

Big O

Some function c*g(n) that is an upper bound to the described function f(n)
Limit method for determining Big O
- If lim (f(n))/(g(n)) exists, then f(n) = O(g(n)) if and only if that limit is < some constant c
- If g(n) is “heavier” then the ratio will tend towards a constant, which means there is some constant that can be applied to g(n) to create an upper bound
- It is possible that f(n) is still limited by O(g(n)) but the limit doesn’t exist for example (2+(-1)^n)g(n)

Little o

a stricter upper bound of Big O. That not only is there some constant C that can upper bound f(n), but ANY constant * g(n) can be used as an upper bound for f(n)
f(n) = o(g(n)) if and only if lim f(n)/g(n) == 0

Big Omega ( Ω )

Ωg(n) there exists a constant c > 0 where c*g(n) ⇐ f(n) for all n > n0
Used to indicate the minimum runtime of the algorithm
- lower bound for the growth rate of f(n)
limit method can also be used for Big Omega Ω
- similar to Big O, but the limit is greater than some constant C. lim f(n)/g(n) > C

Little Omega ( ω )

similar to Little o in its application where we have a more restrictive lower bound for g(n), that for any constant c*g(n) holds as a lower bound for f(n)
f(n) = ωg(n) if and only if lim f(n)/ g(n) == ∞
(n^2) = ω(n) because for any constant c, after some n0, n^2 will be lower bounded by n
- lim → n^2 / n == ∞

Big Theta ( Θ )

If there is some constant that both f(n) = O(g(n)) AND f(n) = Ω(g(n)) then it can be said that f(n) = Θ(g(n)) which indicates the same growth rate

Reflexivity / Symmetric / Transitivity / Transpose

Reflexivity
- A function is always bounded by the upper and lower by itself
- Holds for all functions, O, o, Θ, Ω, and ω
Symmetric
- Holds for Θ notation, where a function that holds for Θ(g(n)), it is true that g(n) = Θ(f(n))
Transitivity
- For all functions, if f(n) = O(g(n)) and g(n) = O(h(n)) ⇒ f(n) = O(h(n))
Transpose
- A big O, or little o relationship between f(n) and g(n) necessarily implies a Big Omega or little omega relationship in the opposite direction respectively

Function Iteration

functionally acts as a function of function, but rather than being F o g or F o h, we are recursively applying F o f.
Written as f ^ i where i is the iteration
- Definitionally f ^ 1 is just the normal definition of the functions
- f ^ 2 ⇒ f(f^1) or in other syntax F o f(x)

Iterated Logarithm

One of these preset functions is logarithm with its own notation “star notation”
- log * n = min{ i >= 0 : log^i n ⇐ 1}
- log(log(log(log (log (2^65536))) = 5
  - log * 2^65536 = 5

Divide-and Conquer / Recurrence Relations

Generally we want to divide the problem into many smaller problems of the same sort.
- Splitting n into two functions of size n/2
Conquer each subproblem recursively
Combine the solutions of subproblems to solve the origianl problem

Merge Sort

Split the array into two halves ( left and right)
if the array size > 2 repeat step 1
else sort the two elements, return
recursively merge the sorted halves

This sorts an array in O(n log n) which is the best possible time for a random input sorting algorithm

Techniques for Solving Recurrences

Recurrence Relations generally follow the form
- T(n) = aT(n/b) + f(n)
- n/b represents the size of the sub problem
- a is the number of times each size is called in each sub problem
- f(n) is the cost for the non recursive part
- Merge sort would have the relation T(n) = 2T(n/b) + f(n)

Substitution Method

guess the form of the solution
Prove the guess using induction

We give some hypothesis, some guess, that the function T will be bounded by Theta( some function)
Given T(n) plug your function in for T(n)
- T(n) = 2T(n/2) + c becomes → T(n) = 2(n log n/2) + c

Recursion Tree

Draw the size and number of nodes labeled in reference to n
- The first layer has one node n, layer two has two nodes of size n/2 etc.
- the root node is the original problem
- The leafs are the base cases. For merge sort that constant would be when the node is of size 1
each layer has a time complexity of the sum of each node
Solve for the height of the tree
The total work = height of the tree * work per level

Master Method

If a < b then we know that the function is sublinear
if b < a then the function is some function greater than n^1 ( aka linear)
Works in the form T(n) = aT(n / b) + O(n^d)
There are three cases for a valid use case
- For all three use cases you need to know log(base b) a [denoted as loga from here on out]and its relation to d
  1. d > loga
    - This means the work dominates, thus T(n) = O(n^d)
  2. d ~ loga
    - If d and loga are of the same magnitude (constant factor)
    - T(n) = O(n^d log n)
    - Ideal case
  3. d < loga
    - This means the recursive work is dominating
    - T(n) - O(n^loga)
Steps to solve
1. pull out a,b,d,
2. find loga
3. compare d and loga
4. choose correct case, and get runtime

Example

Problem 1

T(n) = 4T(n/2) + n

Each layer has 4 subproblems each of size n/2

n^(logba) = n

f = O(n^2)

T(n) = Θ(n^2)

Problem 2

T(n) = 2T(n/4) + n

log b (a) < 1 → log4(2) < 1

f(n) = Ω (n^logba)

f(n) = Θ(n)

Problem 3

T(n) = 4T(n/2) + n

n^log(2) 4 = n

f(n) = Ω(n^2)

T(n) = Θ(f(n)) = n

Maximum Subarray

Given an array find a non-empty contiguous subarray whose values have the largest sum
- Example A = [-2,1,-1,4,-1,2,1,-5,4]
- Maximum subarray B = [4,-1,2,1]
Brute-Force Approach
- With a nested for loop try every possible subarray keeping track of the largest sum from those subarrays. obviously this runs in O(n^2)
Divide and Conquer Solution
1. Divide the array into two parts A[1:mid] and A[mid:A.length]
2. The maximum subarray is either
  1. Entirely in first half
  2. Entirely in second half
  3. Or crossing the midpoint
- The first two cases we can handle recursively in n log n
- For the 3rd case we need to use a find-max-crossing-subarray function
  - We have two sums one fore the left and one for the right, initialized to -inf
  - starting from mid, we go left and right respectively, updating the current and max sum for each side
  - The max_left and max_right is typically the index because the actual sum doesn’t matter just that the subarray is the maximum
  - O(n)
- Compare the three test cases to find the total maximi

Quicksort Algorithm

Has a worst case of O(n^2), but an amortized runtime O(n log n)
- The reason you would use this over merge sort which is guaranteed n log n is because quicksort is an in place algorithm.
- Its possible that you choose a pivot and get unlucky and all the elements end up on one side, causing the time to approach O(n)
Choosing the Pivot
- We can choose an arbitrary position such as the first or last position
- We can choose a random index
- This has an impact on the runtime. a good pivot → (n log n) run time, and a poor choice → (n^2)
Partition function
- both i and j start at 0 and we iterate from 0 → r-1 if the element is less than r, the i index progresses, otherwise the j index progresses.
- We swap i and j if the element is greater than the pivot.
Order of operations
1. split the array into two partitions using some pivot value
2. recursively call quicksort on each partition, to create its own partition
3. In the partition function, we put all elements less than the pivot in the left subarray, and all greater than the pivot in the right subarray
  1. partition garaunteed in the right position at the end ( all to the left are less, and all to the right are higher)
Reccurence Relation
- T(n) = 2T(n/2) + n
- Rule 1 of master relation
  - f(n) > n^(logba)
- T(n) = Θ(n log n)
  Example
  - Array = [2,8,7,1,3,5,6,4]
  - pivot: 4 ( arbitrary, but we are choosing last element)
  - i and j = 0
  - for each element, if x < Pivot (4) add it to A[i]
    
    else A[j] = x
  - after each element checked, A[i+1] = Pivot

Selection Problem

Given some unsorted array, find the smallest element in A
We could first sort the array, with merge sort or quick sort, and then just index A[1], which would solve our problem in O(n log n)

Quickselect

A recursive algorithm that gives us the smallest element, and only that one element in expected O (n) which is much faster than the naive approach
Using the same partition function from above, split and check the lowest with subarray
Can use the Median of Medians to find a good pivot
- Guarantees > 60% of array is in correct subarray relative to the pivot
- break array into groups of 5 (A[0:4], A[5:9], A[10:14])
  - if the last subarray has < 5 elements because n % 5 != 0, then we just choose one element arbitrarily
- find the median of each subarray
- Find the median value of the medians calculated in the previous step. This is your pivot
  Why do we use groups of 5?
  - With larger and larger groups we approach the runtime of simply sorting the array with merge sort. Which defeats the purpose
  - With even groupings the first median would be an average of the two elements, which isn’t very indicative of the true median
  - With groups of 3, we can show with recurrence relation equation that a “good guess” Bn is always more computationally expensive that MoM approach with groups of 5

Decision Tree for Sorting

A decision tree is a binary tree that represents the sequence of comparisons

Each node is labeled with a comparison i:j
each leaf node represents a permutation of the array

Dynamic Programming

What is Dynamic Programming?

An approach to solve a problem, where some part of the subproblem is repeated or reused.

For Example: Fibonacci numbers rely on the values of the previous sequence. So to find f(5) and subsequently f(6) would both require a recalculation of f(1-4)

Fibonacci

Naive Aproach is the recursive call where we return Fibonnaci(n-1) + Fibonnaci(n - 2)

this has a runtime that is exponential (Φ^n)

Memorization Approach - starting from F0 we store any values we haven’t approached, and then in future computations we can pull from the cached values in O(1)

Subsequences

What is a subsequence?

Some sequence X is considered a subsequence of another sequence Y if all the elementoXcan be found in sequence Y in the same order

X = [A, D, C, A] Y = [ B, A, C, D, A, C, A]

X is a subsequence of Y

Increasing Subsequence
- An increasing subsequence is a subsequence Z that stricly increases from element to element
  - Ex. Z = [1,2,4,6]
  - Anti-Example Z =[1,2,4,3]

Longest Increasing Subsequence

Given some sequence X, what is the longest increasing subsequence?
- We are concerned with the length of the sequence, not the value or index.
If we are to find the LIS ending at X i=6, then we know, all the elements we choose in the interval must be less than X[6].
- using these chosen indeces, we can recursively count the LIS ending in each index chosen. The max of these subproblems will be the LIS of the total subsequence - 1
  Example
  - X = [7,2,8,1,3,4,10,6,9,5]
  - Find the LIS ending at X[5]
    
    The largest number must be < 4
  - Find the LIS of {X[1], X[3], X[4]}
    
    { 1, 1, 2}
    
    Max = 2
  - LIS = Max(sub-LIS) + 1 = 3
    
    {1, 3, 4}

Longest Common Subsequence

Definitions

Given two sequences X and Y, we say that a sequence Z is a common subsequence of X and Y if it is a subsequence of both X and Y. Prefix : Any contiguous substring starting from the first element in a String or sequence. Suffix: Any contiguous substring that ends with the final element in a String or sequence.

Example

X = ABCBDAAB Y = BDCABA The sequence BCA is a common subsequence of X and Y BCBA is also a common subsequence between them

Prefix

Dynamic Programming Examples

Chain Matrix Multiplication
- When multiplying a chain of matrices, different order of operations in terms of the associativity of the multiplication will have vastly different run times.
- That is, in terms of the number of calculations needed, and total runtime, A1(A2*A3) != (A1* A2) A3
- We then given some number of Matrices need to find the ordering that results in the least number of multiplications

🪴 Quartz 4.0

Explorer

Exam 1 Notes - CS 4349

Big O, Theta, and Omega Notation - 8/21

Big O

Little o

Big Omega ( Ω )

Little Omega ( ω )

Big Theta ( Θ )

Reflexivity / Symmetric / Transitivity / Transpose

Function Iteration

Iterated Logarithm

Divide-and Conquer / Recurrence Relations

Merge Sort

Techniques for Solving Recurrences

Substitution Method

Recursion Tree

Master Method

Master Method

Maximum Subarray

Quicksort Algorithm

Selection Problem

Quickselect

Decision Tree for Sorting

Dynamic Programming

Subsequences

Longest Increasing Subsequence

Longest Common Subsequence

Dynamic Programming Examples

Graph View

Table of Contents

Backlinks